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Stories that captivated youth in 2021 included Al-driven approaches to predicting protein structure. 


Despite challenges spurred by the ongoing pandemic, last year was a wellspring for 
scientific discovery, exploration, and innovation. From the launch of a massive new 
space station and “microflyers” inspired by maple seeds to new methods for predicting 
protein structure and manufacturing starch from carbon dioxide, and from enlightening 
discoveries about the obesity gene to revelations about the muon’s magnetism that could 
upend the Standard Model of particle physics, 2021 gave us gift upon gift of scientific 
delights, We marveled at the identification of a skull belonging to a previously unknown 
species that could be an even closer ancestor to modern humans than we've ever seen 
before, and we got closer to potentially harnessing fusion energy. It’s fitting that we 
pause a moment and honor those whose tireless yearning for discovery contributed to 
our knowledge about nature's systems and structures in spectacular ways. 

For more than 30 years, Science magazine has been recognizing the intersection 
of human curiosity and the gusto of people who want to give back through 
innovation. The 2021 breakthroughs described below were collected by the News 
team at Science with suggestions from the next generation of science’s movers and 
shakers: kids. Tencent—a world-leading internet company and artificial intelligence 
(Al) leader—leveraged its social, data, and analysis services, and collected votes 
from over 3.4 million young users via Tencent Docs, QQ Browser, and Sogou 
Wenwen. After fully gauging the interests of these teens, Tencent and editorial and 
news teams of Science chose the final 10 breakthrough topics. 

This year, the breakthroughs have two especially interesting themes: Each 
showcases humanity's ever-increasing capacity to tackle tough problems, while 
it also takes a significant step in shaping how we will explore the next frontier of 
the unknown. For example, when scientists utilized two Al-driven approaches to 
predict protein structure—an amazing discovery given the high complexity and 
large quantity of these twisty, geometric shapes—they ticked multiple boxes. By 
demonstrating that Al could help solve this knotty problem, they further opened the 
door to Al's expanding power as a tool for pursuing other scientific investigations. 

Another 2021 breakthrough involved Chinese scientists discovering a de novo 
route for artificial starch synthesis from carbon dioxide (CO,). Starch is the major 


component of grain as well as an important industrial feedstock. In nature, it is 


mainly produced by green plants fixing CO, through photosynthesis, and for crops 
the theoretical energy conversion efficiency of this process is no more than 2%, 
Since it does not rely on plant photosynthesis, this new technology requires only 
carbon dioxide, hydrogen, and electricity as raw materials, saving extensive farming, 
fertilizing, and processing of large quantities of crops. The novel artificial route 
designed by researchers consists of only 11 core reactions to convert CO, into starch, 
with an efficiency 8.5-fold higher than starch biosynthesis in maize. Synthesizing 
starch will be a significant disruptive technology in today's world. It allows us to 

see more possibilities: A shift in the agricultural mode from traditional planting 

to industrial manufacturing, which helps address the pressing issues of food 
production, arable land, ecology, and climate. And even further into the future, this 
technology will prove to be indispensable for space colonization. 

Expanded capacity and innovative methodology has emboldened us, 
engendering a renewal of the human spirit to take on the most formidable 
challenges. As scientists and engineers collaborate in harmony, prompted only 
by a dedication to excellence and a commitment to serve, we witness 
breakthroughs that help us to see the world differently. But perhaps more 
importantly, these breakthroughs enable us to see ourselves differently—as 
better, stronger, more agile, and more committed to excellence than ever before. 
This selection of annual breakthroughs is more than just a review of achievements 
in the past year. More importantly, it carries forward the spirit of science from 
generation to generation. We hope that by passing the torch of science, more 
talented young minds will be inspired to devote themselves to scientific careers 
and press ahead with a passion for seeking the truth and the audacity to 
overcome the greatest hardships. 
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REWARDING HIGH-RISK RESEARCH. 
SUPPORTING EARLY-CAREER SCIENTISTS. 
HELPING 10 FIND CURES FASTER. 

APPLY TODAY 


Now accepting applications for the Michelson 
Philanthropies & Science Prize for Immunology. = 
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The Michelson Philanthropies and Science Prize for 


Immunology focuses on transformative research in 
human immunology, with trans-disease applications 
to accelerate vaccine and immunotherapeutic 


discovery. This international prize supports investigators 


35 and younger, who apply their expertise to research 
that has a lasting impact on vaccine development and 
immunotherapy. It is open to researchers from a wide 


“The Michelson Philanthropies & Science 
Prize for Immunology will greatly impact 
my future work. As | am just starting my 
scientific career, it will illuminate my work, 
spark interest and support me to continue 
my research in this field.” 


Paul Bastard, MD, PhD, 


range of disciplines including computer science, 
artificial intelligence/machine learning, protein 
engineering, nanotechnology, genomics, parasitology 
and tropical medicine, neurodegenerative diseases, 
and gene editing. 


Laboratory of Human Genetics 
of Infectious Diseases, Imagine 
Institute (INSERM, University of 
Paris), Paris, France; and The 
Rockefeller University, New York. 


Dr. Bastard received the inaugural 
Grand Prize for his essay: “Why 
do people die from COVID-19: 
Autoantibodies neutralizing type 

| interferons increase with age.” 


Application deadline: Oct. 1, 2022. 


For more information visit: 
www.michelsonmedicalresearch.org 


GRAND PRIZE: 
$30,000 


#MichelsonPrizes 


FINALIST PRIZE: 
$10,000 
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Forests produce water vapor and other compounds that promote cloud formation, helping to cool the planet. 
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Scanning electron micrograph of minuscule 
coccolithophorid plankton cell wall coverings 
preserved as exquisite fossil impressions on 
the surface of 183-million-year-old Jurassic 
organic matter. Coccolithophores are a type 
of microscopic marine phytoplankton, and 
their hard calcareous plates, called coc- 
coliths (~5 micrometers long), are normally 
abundant in the fossil record. Here they have 
dissolved from these 
ancient rocks deposited 
during a global warming 
interval, leaving only their 
imprint “ghosts” behind. 
See pages 795 and 853. 
Credit: Sam M. Slater, Paul 
Bown, Richard J. Twitchett, 
Silvia Danise, Vivi Vajda 
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EDITORIAL 


The Court is ignoring science 


recently leaked draft opinion indicated that the 
US Supreme Court is prepared to overturn Roe 
v. Wade as early as next month in the matter 
of Dobbs v. Jackson Women’s Health Organiza- 
tion. In doing so, the Justices won’t just be dis- 
regarding decades of precedent. They’ll also be 
disregarding ample evidence of abortion’s posi- 
tive impact on patients’ health and well-being. 

In Roe v. Wade (1973), the Court struck down a Texas 
law criminalizing abortion and held that the Consti- 
tution protects the right to decide whether to end a 
pregnancy. Justice Samuel Alito’s leaked opinion in 
Dobbs stems from the idea that abortion rights are not 
mentioned in the Constitution or rooted in US history. 
But we, as a society, are equipped with more factual 
information today than the framers of the Constitu- 
tion were. There is a long history of Supreme Court 
abortion decisions drawing on evi- 
dence. In Whole Woman’s Health 
v. Hellerstedt (2016), the Court 
emphasized the importance of 
considering data when reviewing 
abortion restrictions. It is critical 
now that the Court adhere to prec- 
edent and insist that constitutional 
rights be guided by evidence, not 
by ideology. 

The Turnaway Study, which I 
led, was designed to contribute to 
that body of evidence. Launched in 
2007, more than 40 scientists from 
nine universities and four research 
institutes worked on the study. We were inspired to do 
so by Justice Anthony Kennedy, who raised the ques- 
tion of an abortion’s effect on a pregnant woman in 
Gonzalez v. Carhart (2006), a case that considered 
the constitutionality of a federal ban on one abortion 
procedure. In his majority opinion, Justice Kennedy 
argued that there were “no reliable data to measure 
the phenomenon” of potential abortion regret, and the 
Court thus upheld the law. 

My colleagues and I resolved to find that data—so 
that Supreme Court Justices and policy-makers could 
base their decisions not on conjecture, but on hard sci- 
ence. To that end, the Turnaway Study followed almost 
1000 women across the country for 5 years to see how 
having an abortion, or being denied one, affected their 
physical health, mental health, finances, relationships, 
and children. 

What we found was staggering. 


* https://www.ansirh.org/sites/default/files/publications/files/ 
turnawaystudyannotatedbibliography.pdf 
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“Itis critical now 
that the Court... 


be guided by 
evidence, 
not by ideology.” 


The research revealed that patients who were able 
to receive an abortion were more than six times more 
likely to report aspirational 1-year plans than those 
who were denied one. They are more likely to have 
a wanted child later and better able to take care of 
the children they already have. Because the majority of 
abortion patients are already parents, this means that 
being able to obtain an abortion has powerful, multi- 
generational impacts. 

By contrast, if people are forced to carry a pregnancy 
to term, they are more likely to experience lasting finan- 
cial hardships. After being denied an abortion, women 
had three times greater odds of being unemployed 
than those who obtained abortions and had four times 
higher odds of being below the federal poverty level. 

Their physical and mental health are also at risk: 
Women unable to obtain an abortion said they had 
more symptoms of anxiety, lower 
self-esteem, and lower life satis- 
faction. They were more likely to 
report “fair or poor” health than 
those who had received abortions. 
And, again, their families feel the 
effects: Patients report more diffi- 
culty bonding with their baby, and 
their older children have worse 
developmental outcomes and are 
more likely to live in poverty. 

The clearest finding from the 
Turnaway Study is that people 
know what is best for themselves 
and their families. The results are 
not theories or guesses. They are not anecdotes to be 
trotted out at politically expedient moments. They are 
based on a large set of data in which statistically sig- 
nificant patterns emerged across many people. And 
they are the basis for more than 50 articles that have 
been published in high-quality, peer-reviewed scien- 
tific journals*. 

As a researcher, I believe that science should not 
be merely an ivory-tower pursuit, divorced from the 
world around it. Understanding the nature, causes, 
and solutions to human problems is one of the primary 
goals and great gifts of science. Science is clearly rel- 
evant to the controversial issues of our time, including 
abortion access in the United States—in fact, science is 
especially critical in these moments. The highest court 
in the United States should not ignore it. 


—-Diana Greene Foster 
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66 We don't think that [China's zero COVID-19 policy] 


is sustainable, considering the behavior of the virus. 99 


Tedros Adhanom Ghebreyesus, head of the World Health Organization, referring 
to strict lockdowns to prevent spread. China called the remark “irresponsible” and censored it. 


Edited by 
Jeffrey Brainard 


Aworker disinfects a store in Pyongyang, North Korea, as an illness believed to be COVID-19 spreads. 


COVID-19 


A‘fever’ hits unprotected North Korea 


id agencies are gearing up for a robust response against what 

North Korea calls a “malignant fever” sweeping the country that 

is almost certainly COVID-19. As Science went to press, new 

cases of the illness appeared to be doubling every 2 days in the 

world’s last large population, some 25 million people, lacking 

immunity to the disease from vaccination or infection. “We’re 
looking at a major, major catastrophe,” says Kee Park, a global health 
specialist at Harvard Medical School. Early in the pandemic, North 
Korea closed itself off from the world in a bid to keep the virus at bay. 
That strategy largely succeeded for 2 years, despite occasional reports 
of possible outbreaks. Until last month, North Korea had tested just 
66,000 people for COVID-19 and reported zero cases. It also ignored 
offers of COVID-19 vaccines. But the “great upheaval,” as North Korea’s 
leader Kim Jong Un calls the outbreak, may persuade the reclusive 
nation to accept such assistance as vaccines, antivirals, and rapid diag- 
nostic tests. “We need to fast-track the aid,’ Park says. 
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Shots cut risks of Long Covid 


PUBLIC HEALTH | A study of more than 

1 million Americans has found vaccination 
is linked to substantially lower risk 

of lingering symptoms after COVID-19 
infections, often called Long Covid. 

The result, published on 7 May in Open 
Forum Infectious Diseases, confirms 
similar findings from smaller studies. 
Researchers at Case Western Reserve 
University used electronic medical 
records to compare symptoms in vac- 
cinated and unvaccinated people 

before and 3 months after an infection, 
both before and after vaccines became 
available. The data include about 

1.5 million unvaccinated U.S. residents 
and 25,000 who had been vaccinated. 

All participants had infections confirmed 
by polymerase chain reaction testing. The 
researchers looked for a range of new 
health concerns after infection, such as 
diabetes, heart disease, and hypertension. 
They also examined the persistence of 
symptoms associated with COVID-19, 
such as fatigue and body aches. For each 
symptom or diagnosis studied, rates 
were lower in the vaccinated cohort. For 
example, rates of persistent headaches 
and body aches in the vaccinated group 
were roughly half. 


Antidepressant nixed as COVID Rx 


DRUG DEVELOPMENT | The U.S. Food and 
Drug Administration (FDA) decided this 
week not to approve the antidepressant 
fluvoxamine to treat COVID-19. 
Fluvoxamine is a selective serotonin 
reuptake inhibitor typically prescribed 
for depression or obsessive-compulsive 
disorder. In October 2021, a randomized, 
controlled trial in Brazil showed unvac- 
cinated people with preexisting conditions 
who took the drug were 32% less likely 
than those who took a placebo to be 
hospitalized or need prolonged medical 
care. But FDA said proof of the drug’s 
effectiveness was “not persuasive.” Other 
treatments for COVID-19, including the 
oral antiviral pills Paxlovid and molnupi- 
ravir, have been available since October 
2021 and have proved to be more effective 
than fluvoxamine. 
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A1380-ton section of 
the ITER fusion reactor 
is one of nine that 

will be welded together. 


Leader who revamped fusion project dies 


ernard Bigot, who as director-general of the giant ITER 
fusion test reactor under construction in France was 
credited with turning around the behind-schedule and 
overbudget project, died last week at age 72 after an 
illness. A theoretical chemist, Bigot took control of the 
world’s biggest science project in 2015, as some politicians in 
the United States were pushing to withdraw from the seven- 
member partnership. Bigot transformed ITER from an industrial 
stimulus program to a bona fide scientific experiment, observers 


Tooth puts Denisovans in Laos 


ARCHAEOLOGY | A molar recovered from 
a cave in northeastern Laos in 2018 closely 
resembles the few known examples of 
molars of the mysterious extinct human 
species known as Denisovans, close cousins 
of the Neanderthals. Previous Denisovan 
fossils have been found in Siberia and Tibet. 
The new finding, from Southeast Asia, 
greatly expands Denisovans’ known range 
and suggests they occupied several eco- 
logically diverse landscapes. The research 
team dated the single, human-looking 
molar to approximately 130,000 to 160,000 
years ago. Although they couldn’t extract 
DNA, they managed to identify strands of 
ancient proteins within the enamel that 
confirmed it was from a member of our 
genus, Homo, and was most likely female, 
the team reports this week in Nature 
Communications. Some Denisovan DNA 
persists in modern human populations, par- 
ticularly in people from Southeast Asia. 
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China probes academic database 


PUBLISHING | China’s market regulation 
agency last week announced it is inves- 
tigating whether the country’s largest 
academic database engages in monopolis- 
tic behavior. China National Knowledge 
Infrastructure (CNKI), launched in 1996 
by Tsinghua University and a state-owned 
software firm, provides access to the 

full text of 280 million papers and dis- 
sertations in both Chinese and English 
collected from more than 90% of the 
country’s academic journals. Little of this 
material is found in other databases. In 
recent years, CNKI’s aggressive accu- 
mulation of papers has led to copyright 
disputes, and the database has faced 
complaints about fees. Last month, the 
Chinese Academy of Sciences refused to 
pay the reported 10 million yuan ($1.5 
million) annual charge and dropped the 
service, preventing access to the database 
by its more than 100 research institutes. 


say. In fact, the parts for the ITER reactor are now more than 
75% complete. Last week, ITER announced the installation of 
the first of nine segments of its 11-meter-tall, doughnut-shaped 
vacuum chamber—although further work is paused as regula- 
tors look into certain safety issues. When it starts up in 2025, 
the facility, costing more than €17 billion, aims to fuse hydrogen 
nuclei to form helium, ultimately yielding 10 times more energy 
than is needed to produce the plasma. Bigot will be succeeded 
on an interim basis by Deputy Director-General Eisuke Tada. 


Energy science office head OK’d 


LEADERSHIP | The U.S. Senate last week 
confirmed a soil scientist as chief of the 
U.S. Department of Energy’s Office of 
Science, the nation’s single largest funder 
of the physical sciences. Asmeret Asefaw 
Berhe of the University of California 
(UC), Merced, studies how soil absorbs 
atmospheric carbon dioxide. Some physi- 
cists complained that Berhe, who served 
as interim associate dean of UC Merced’s 
graduate division, lacks leadership 
experience and relevant scientific back- 
ground to lead an office that spends the 
large majority of its $7.48 billion budget 
on physics research. But some previ- 

ous directors of the office argued the 
criticism was off base, and many climate 
researchers cheered her selection. The 
Senate confirmed her by a 54-45 vote, 
with just four Republicans voting for her. 
Berhe becomes the first Black person to 
lead the office. 
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‘COVID-ization’ of research levels off 


Researchers who pivoted from distant disciplines produced lower impact work, study finds 


By Jeffrey Brainard 


n early 2020, as the COVID-19 pan- 
demic took hold, a lab run by virologist 
John Schoggins at the University of 
Texas (UT) Southwestern Medical Cen- 
ter became one of many around the 
world to shift its full attention to the 
crisis. He and his seven-person lab offered 
help to other scientists and physicians—by 
testing human saliva for the SARS-CoV-2 
virus, for example—and also produced a 
handful of pandemic-related research papers. 
But in recent months, as vaccines 
and treatments helped reduce the 
pandemic’s severity, members of the 
lab have turned away from work- 
ing on COVID-19. With so many 
researchers piling into the field, 
Schoggins says, “there was a sense 


table, below). And in fields more distant from 
pandemic science, the share of COVID-19 pa- 
pers is declining, suggesting researchers are 
returning to their core interests. 

Those developments are not surprising, 
observers say. “Now that many of the clini- 
cal and epidemiological knowledge gaps 
have been [filled], the research focus of most 
clinicians and epidemiologists is rightfully 
moving back towards their own specialist 
interests,” says Alimuddin Zumla, an infec- 
tious diseases researcher at University Col- 
lege London, who was not involved in the 


Growth chart 


The rise in COVID-19 papers has slowed in some fields and peaked in others. 
Data for this year are through mid-April only. Percentages are rounded. 


% OF PAPERS ABOUT COVID-19 


of saturation.” As a result, his Ph.D. 
candidates began looking elsewhere 
for promising dissertation topics. 
Schoggins’s lab is part of a wider 
pivot away from COVID-19 research, 
recent analyses of publishing trends 
suggest. Overall, the number of pan- 
demic-related papers appears set to 
decline this year, after explosive and 
unprecedented growth in 2020 and 
2021 (see graph, p. 783). In key dis- 
ciplines such as infectious diseases 
and public health, the proportion 
of new papers devoted to COVID-19 
appears to be flattening out (see 
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analyses. That’s not necessarily a bad thing, 
he adds: “There are many other priority 
killer [infectious] diseases which have been 
neglected during the COVID-19 era that need 
attention.” Other evidence suggests that in 
some fields, researchers who plunged into 
COVID-19 studies didn’t always produce their 
best work. 

The pandemic prompted a massive in- 
flux of scientists into related research. As 
of April, more than 500,000 pandemic- 
related journal articles and preprints had 
appeared, according to an analysis of the 
Dimensions bibliographic database 
by Philip Shapira of the University 
of Manchester, who studies indus- 
trial innovation. Although those 
publications make up just 4% or 
so of all scientific papers published 
from 2019 through early this year, 


CHANGE IN 

a ciate PERCENTAGE the surge of papers on a new topic 
2019 2020 2021 2022 _ POINTS,2021-22 was unmatched inthe historyofsei- 
Virology 3.1% 174% 284% 371% 88 ence. In certain disciplines the shift 
Infectious diseases 0.8% 13.2% 23% 23.8% 0.9 was especially dramatic. Shapira’s 
analysis—presented in an April 
Public environmental | 0.2% 78% 170% 175% 05 bioRxiv preprint—shows that in 
occupational health virology, the share of papers focused 
=: on coronaviruses and the diseases 
Emergency medicine | 0.0% 78% 14% 80% -3.3 they cause went from roughly 3% in 
2019 to 28% in 2021, and in infec- 
Medical informatics 0.0% 63% 13.8% 115% -2.3 tious diseases the share rose from 

less than 1% to 23%. 
Respiratory system 0.1% 63% 101% 9.5% -0.6 Such numbers have raised con- 
cerns about what some scientists 
All science fields 0.0% 17% 37% 3.9% 0.3 call the COVID-ization of research. 
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They fear too many researchers rushed into 
work outside of their expertise, resulting in 
poor-quality studies. 

One recent analysis suggests such fears are 
not unfounded. Two-thirds of authors who 
had at least one publication on COVID-19 
in 2020 had no previous papers on a related 
topic, Dashun Wang of Northwestern Univer- 
sity and colleagues reported in an arXiv pre- 
print in July 2021. 

In addition, by using a metric it devel- 
oped, the team found that papers published 
on COVID-19 in 2020 had lower impact 
on average than non-COVID-19 papers 
published during the same year. Using a 
different metric that measured a paper’s 
novelty, the group found that the further 
a researcher had pivoted from their usual 
area of expertise, the lower the impact of 
their COVID-19 publications. 

The lower impact of COVID-19 publica- 
tions—and the corresponding lower profes- 
sional rewards—could be one reason some 
researchers are retreating. Overall, Shapira’s 
analysis found that, in most of the roughly 
230 scientific fields he studied, the share of 
papers focused on COVID-19 continued to in- 
crease through early this year, but at slower 
rates compared with 2021. But in about 
70 fields—including emergency medicine, 
pharmacology, and the study of the respira- 
tory system—the share is dropping. 

Another likely cause of the shifts: Jour- 
nal editors have become more choosey, 
says Jasper Fuk-Woo Chan, a virologist at 
the University of Hong Kong. He helps edit 
several journals and receives a steady flow 
of requests to review COVID-19 papers for 
others. “It’s getting more difficult to publish 
[COVID-19 articles] in good journals,” he says, 
with editors “expecting more details, more 
robust data, than in the first 1 to 2 years” of 


Topping out? 

The number of publications on COVID-19 is forecast 
to decrease this year, although data lags might affect 
totals. (The total in all fields is also set to drop.) 
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the pandemic. At that time, he says, journals 
published many duplicative “me too” papers. 

The easing of the COVID-19 publishing 
frenzy doesn’t diminish the value of the 
work done by scientists who jumped into 
COVID-19 research—and of the experience 
they gained. “Not everything paid off,’ 
Shapira says. But the influx “probably 
trained a whole set of researchers to think 
about pandemics from medical, public 
health, and other aspects. There’s been a 
human capital investment.” 

For early-career researchers, however, the 
consequences of the massive pivot could be 
mixed. For some, the surge provided funding 
and publishing opportunities that could help 
their careers. But it may not be easy for them 
to adapt and prosper as funding and enthu- 
siasm for COVID-19 research wane. 

For example, when the UT Southwestern 
Medical Center recently hired a number of 
assistant professors, close to half of the ap- 
plicants had shifted into COVID-19 research 
and some had co-authored high-profile stud- 
ies, says microbiologist Julie Pfeiffer, a mem- 
ber of the search panel. That raised some red 
flags, says Schoggins, who also served on the 
panel. “We were concerned about their future 
fundability and ability to make an impact,” 
he says. “When they leave that incubator of 
high-profile [COVID-19] work, can they start 
their own lab and be independent?” 

Future fundability might be of less con- 
cern for senior researchers who have stud- 
ied COVID-19. In 2020 these older scientists 
were more likely than their younger peers to 
pivot into such work, Wang’s literature anal- 
ysis found. And the papers often reported no 
grant support, Wang notes, leading him to 
speculate that senior scientists had the fi- 
nancial wherewithal to jump into new lines 
of research. 

Discerning how many of the research- 
ers who rushed into pandemic research 
ultimately remain could take several more 
years. But Seema Lakdawala, a virologist at 
the University of Pittsburgh, is one scien- 
tist who plans to stay the course. Before the 
pandemic, she studied influenza viruses and 
their airborne transmission, so investigating 
that aspect of SARS-CoV-2 became a natural 
fit for her group, she says, and she expects 
to continue studies on both types of viruses. 

She is less concerned about the pivot away 
from COVID-19 by some researchers than 
about whether funding for such work, cru- 
cial to coping with future pandemics, will be 
sustained. “What I worry about is [whether] 
after this pandemic and its fatigue are over 
... we'll have the investment that we need in 
all aspects of virology,’ Lakdawala says. “We 
really need to be prepared for another even- 
tual threat. It’s not hard to believe that this is 
going to happen again.” 


COVID-19 


Do you 

hate your 
face mask? 
There’s hope 


A U.S. government 
contest has 10 companies 
competing to make better 
face coverings 


By Jon Cohen, in Boulder, Colorado 


n the shipping room of his factory here, 

Richard Gordon pulls open the drawer 

of a restaurant-style convection oven 

to show off a tray filled with his com- 

pany’s new, freshly sterilized product: 

multicolored face masks that feature 
an origami design. 

“I thought masks were a total horror,’ 
Gordon says. “They looked awful, felt aw- 
ful, were hard to breathe in, were hot, 
and leaked.” So he and Min Xiao, his wife, 
started a company named Air99 in 2016 to 
produce something much better. 

Now, their mask, named the Airgami, is 
vying for part of the half-million dollar purse 
in the final phase of the Mask Innovation 
Challenge, run by the U.S. Biomedical Ad- 
vanced Research and Development Author- 
ity (BARDA). The contest aims to promote 
masks that have a better fit, function, and 
look than existing designs and to nurture 
the “rather underfunded and a little stag- 
nant” ecosystem of mask development, says 
Kumiko Lippold, a BARDA pharmacologist 
and toxicologist who organizes the challenge. 

Lippold acknowledges the contest may 
seem “a little bit behind the curve,’ given 
that the pandemic has abated and many 
countries have dropped masking require- 
ments. Still, there’s “a significant appetite for 
mask innovation,’ she says. SARS-CoV-2 may 
have surprises in store that will require peo- 
ple to mask up again—and there will likely 
be other pandemics. “We're building tomor- 
row’s mask,” Lippold says. 

The 10 finalists, selected from 1448 en- 
trants, include mom-and-pop innovators like 
Air99, a team at Georgetown University, and 
industrial giants Amazon and Levi Strauss 
& Co. To evaluate the masks, BARDA has 
teamed up with the National Institute for 
Occupational Safety and Health (NIOSH)— 
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which tests and approves N95 
“respirators,” the type that snugly 
fit the face and have a high filtra- 
tion efficiency—and the National 
Institute of Standards and Tech- 
nology (NIST). BARDA plans to 
announce a winner in October. 

With billions of people don- 
ning face masks for the first time 
in 2020—and complaining about 
their shortcomings—the pan- 
demic has triggered a surge in 
mask research. In a study pub- 
lished in June 2021, for example, 
NIOSH engineer and aerosol re- 
searcher William Lindsley and col- 
leagues compared 19 widely used 
face coverings by attaching them 
to a respiratory aerosol simula- 
tor, a mannequin headform that 
breathes and coughs. 

All masks help, Lindsley 
stresses: “The two biggest mis- 
conceptions are that they don’t 
work and that theyre magic 
and you're protected, no matter 
what.” But bandanas are “ter- 
rible” at both filtering inhaled 
air and capturing aerosols when 
people exhale and cough, the 
study found. Cloths, neck gaiters, 
and medical masks are much bet- 
ter, but still pale in comparison to NIOSH- 
approved N95 masks. (In Europe, the equiv- 
alent of N95s are known as FFP2 masks.) 

The finalists in the BARDA challenge 
each offer unique improvements. In Air- 
gami’s case, beauty is important, says 
Gordon, an electrical and computer 
engineer—but that’s not why he and Xiao 
entered the field. Their quest began well 
before the pandemic, when they moved to 
Suzhou, China, in 2011 for Xiao’s new job. 
Pollution there was horrible, and the N95 
masks they had brought, made by 3M for 
construction workers, didn’t fit their young 
son. “I immediately started cutting out 
3M masks and gluing them and stapling 
them just to shrink them down to fit him,” 
Gordon says. “Gotta give [the] kid clean 
air. Very, very simple,” he says. 

His son’s problem, coupled with the dis- 
comfort and fit issues he had with his own 
face coverings, led Gordon to devote him- 
self to designing a better mask after the 
family moved back to the United States in 
2015. He stumbled onto an origami show, 
Above the Fold, that had a “mind-blowing” 
piece by physicist Robert Lang, a world- 
renowned origami mathematician and 
artist. The Airgami is a twist on a popu- 
lar origami design, the magic ball—also 
known as the dragon’s egg—sliced in half, 
which creates a large breathing space and 
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The Airgami is one of 10 finalists in a competition for next-generation masks. 
“| thought masks were a total horror,’ says Richard Gordon, its developer. 


fits tightly on the face. The inner of three 
polypropylene layers has an electrostatic 
charge to trap particles—the heart of N95 
technology. The mask is reusable, can be 
rinsed or disinfected with heat, and comes 
in four different sizes and various color- 
ful prints, including rainbows and cam- 
ouflage. Lang, who is now on the Air99 
board, helped Gordon create a computer 
program to automate the creasing. 

Even so, each mask must be hand- 
assembled and sells for $29.99. But 
Gordon says supply can’t meet demand. 
“The world is flooded with $1.50 masks, 
and there’s no way we’re going to compete, 
but they’re all ugly and they don’t neces- 
sarily fit great,’ says Gordon, who hopes to 
lower the price with more automation. 

Another finalist, Amazon’s PerfectFit 
Mask, also uses an origami design and 
comes in various fashion patterns and sizes. 
A company named 4C Air, co-founded by 
physicist and Nobel laureate Steven Chu, 
makes the BreSafe Transparent Mask, which 
aims to improve masked conversations 
by allowing a listener to see the speaker’s 
lips. The hard-shell AtmoBlue mask, made 
by Blue Sky Labs, has built-in fans that 
blow incoming air across high-efficiency 
particulate absorbing filters and a sensor 
that monitors air quality for pollution. The 
Georgetown group developed nanoporous 


metal foams that are extraordi- 
narily efficient filters, ultralight- 
weight, and also reusable. 

Levi Strauss took a differ- 
ent tack: Its mask, the Veil, of- 
fers N95-level protection with a 
simple design that any garment 
manufacturer can produce with 
scissors and a sewing machine— 
and sports the brand’s world- 
famous logo for added coolness. 
There is even a face covering 
especially for toddlers made by 
PaciMask that allows parents 
to attach a pacifier and features 
cartoon characters, animals, and 
spaceships. (Its slogan: “It’s just 
a mask, baby!”) 

NIOSH’s standard N95 test as- 
sesses masks’ filtration efficiency 
by exposing them to aerosolized so- 
dium chloride and measuring the 
amount that passes through. An 
N95 rating means a mask filters at 
least 95% of “nonoily” (hence the 
N) particles. But for the BARDA 
challenge, NIOSH devised addi- 
tional tests. “We got to really think 
about the testing innovation,’ 
says physical chemist Sandeep 
Patel, who heads the BARDA di- 
vision overseeing the challenge. 
Recognizing that mask fit depends on face 
shape, for example, researchers designed 
five different-size mannequin headforms, 
based on the faces of nearly 4000 people. 
The mask challenge encourages entrants to 
supply analyses of how their masks fit digital 
versions of all five. 

NIST fluid dynamicist Matthew 
Staymates will also test the finalists for 
leakage with schlieren imaging, which uses 
lenses and mirrors to visualize changes in 
air temperature. Staymates couples this 
with high-speed video cameras, which al- 
lows him to capture air escaping from the 
edges of masks when people breathe. The 
major challenge confronting maskmakers 
isn’t new materials, but design, Staymates 
says. “We can make fabrics that have amaz- 
ing filtration efficiency, and the N95 is a 
great example,” he says, but “how can we 
get smart about designing shapes that can 
really seal well so my glasses don’t fog up?” 

Gordon and Xiao see a bright future for 
their company even after the pandemic 
ends. “We started out as an antipollution 
mask, and I think it is still the core busi- 
ness,” Xiao says. Still, COVID-19—and be- 
coming a BARDA finalist—has given the 
company a boost they never imagined, 
Gordon says: “The pandemic was the 
greatest marketing awareness campaign of 
all history.” & 
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ASTRONOMY 


Shadow of Milky Way’s giant black hole revealed 


Scene resembles earlier image from a distant galaxy, as Einstein's theory predicts 


By Daniel Clery 


he first image of the supermassive 

black hole at the center of the Milky 

Way Galaxy—or at least of its shadow— 

transfixed astronomers last week. 

Eight radio observatories around the 

globe and more than 300 scientists 
joined forces to image the object known as 
Sagittarius A* (Sgr A*), a feat thought impos- 
sible until just a few years ago. “It’s a truly im- 
pressive accomplishment,” black hole expert 
Roger Blandford of Stanford University says 
of the work, presented at press conferences 
on 12 May. 

The same team, known as the 
Event Horizon Telescope (EHT), in 
2019 produced the first ever image 
of a black hole, at the center of the 
nearby giant galaxy M87. The M87 
black hole is 1600 times the mass of 
Sgr A*. Yet the similarity of the two 
images—bright rings of gas trapped 
in death spirals around these ulti- 
mate sinkholes—shows how Albert 
Einstein’s theory of gravity, gen- 
eral relativity, works the same at all 
scales. “In both cases, what we see is 
the heart of the black hole, the point 
of no return,” team member Feryal 
Ozel of the University of Arizona said. 

Compared with M87, which con- 
verts swirling gas into a powerful 
jet thousands of light-years long, 
Sgr A* is quiet. “M87 was exciting 
because it was extraordinary,’ says 
CfA’s Michael Johnson. “Sgr A* is ex- 
citing because it’s common.” Initial 
analysis of the new image suggests 
only a trickle of gas makes it to the black 
hole, and only one part in 1000 is being 
converted to light, Johnson says. Compar- 
ing the image with computer simulations 
points to another feature of Sgr A*, Ozel 
says. “We may be seeing the first hints” that 
the black hole is spinning. 

Although the black holes in the centers of 
galaxies have enormous masses—millions 
or billions of times that of the Sun—their 
intense gravity means their outer edge, the 
event horizon, is tiny in galactic terms. Sgr 
A*, which has a mass of 4 million Suns, has 
an event horizon that is just four times the 
Sun’s diameter. Imaging something so small 
from 27,000 light-years away presents a 
huge challenge for astronomers. 
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Only radio telescopes can peer through 
the clouds of dust around the galactic cen- 
ter, but their long wavelengths offer poor 
resolution. The shortest radio wavelengths, 
of about 1 millimeter, can do better, and 
combining multiple telescopes can further 
sharpen the image. EHT observes the ga- 
lactic center simultaneously with dishes 
spread across Earth’s face, then processes 
the data with banks of computers as if each 
dish was a small patch of an Earth-wide 
telescope—a technique known as very long 
baseline interferometry (VLBI). “Each pair 
[of telescopes] contributes a little bit of in- 
formation to the entire image,” says EHT 


Aring of glowing gas surrounds a void that encloses 
the Milky Way’s 4-million-solar-mass black hole. Astronomers don’t 
know whether the bright knots are real or artifacts. 


team member Katie Bouman of the Califor- 
nia Institute of Technology. 

By 2017, the team was ready to take a 
shot at Sgr A* and the nearby giant galaxy 
M87. Data came from eight observatories, 
from Hawaii to Spain and from Arizona to 
the South Pole. A key addition was the Ata- 
cama Large Millimeter/submillimeter Array 
in Chile, a group of 64 dishes with the com- 
bined area of an 84-meter-wide telescope. 

In April 2019, the team released its now 
famous image of M87, a result chosen as 
Science's 2019 Breakthrough of the Year 
(Science, 20 December 2019, p. 1434). It 
shows the event horizon’s silhouette, magni- 
fied 2.5 times by gravitational effects. Some 
of the light in the surrounding ring was actu- 


ally emitted behind the black hole, its path 
bent by the intense gravity so that it appears 
to come from the edge. 

Getting an image for Sgr A* proved to be 
much harder. One reason was that the tele- 
scopes were viewing it through the crowded 
central plane of the Milky Way, where elec- 
trons from ionized gases scatter the ra- 
dio waves. Johnson describes it as peering 
through “frosted glass.” Another challenge 
was motion. Gas moves slowly around M87’s 
giant black hole, taking days to orbit the 
event horizon. But around the much smaller 
Sgr A*, an orbit takes minutes. “If an object 
changes in a crazy way, you can’t image it 
with VLBI,” says EHT team member 
Heino Falcke of Radboud University. 
But he says that after 2 years scruti- 
nizing the results, the team is confi- 
dent it has been able to extract the 
steady part of the signal. “On top of 
the chaotic structure you have a sta- 
ble structure,” he says. 

Unlike for M87, the mass of Sgr A* 
is known very precisely from studies 
of star orbits close to the black hole, 
so the EHT researchers had a firm 
idea of how big of a ring they should 
see. The ring they found was “bang 
on,” says EHT team member Dom 
Pesce of CfA. But the three “knots” 
visible in the image might be artifacts 
of the observation process, Ozel says. 

The EHT team carried out further 
observing campaigns of M87 and Sgr 
A* in 2018, 2021, and 2022. “All data 
are in various stages of calibration 
and processing,” Pesce says. The ac- 
cumulating data could help research- 
ers understand how matter moves around 
a black hole and is sometimes funneled 
into jets. The EHT array has also expanded 
since 2017, adding new dishes in Greenland, 
France, and the United States, with plans for 
another in Namibia. 

In the future, the researchers plan to ob- 
serve at a shorter wavelength—0.86 milli- 
meters, compared with the 1.3 millimeters 
used so far—which will allow them to see 
even closer to the event horizon. They will 
also attempt time-lapse movies of the gas 
swirling around the black holes. Observing 
M87 every 2 weeks is the first goal. Later, 
they will try for Sgr A*-The Movie. “We're still 
in the infancy of this field,” Pesce says. “The 
first baby steps.” 
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Dust storms like the one that engulfed cities in southeastern Brazil in 2021 could become more frequent as dryland soils retain less moisture. 


Soils’ coating helps keep dust storms at bay 


Breakdown of “biocrusts” allows wind to attack dryland soils 


By Elizabeth Pennisi 


wo years ago, 24 million tons of dust 
lifted high above Africa to create a 
plume that swirled across the Atlan- 
tic Ocean and covered Puerto Rico 
in a pink patina. An unusually me- 
andering jet stream helped launch 
the monster dust cloud. But this week, 
ecologists fingered another factor in such 
storms: the disappearance of bio- 
crust, a microbial mat that coats 
dryland soil and helps keep dust in 
place. “It glues the sand together,” 
says Bettina Weber, an ecologist 
at the University of Graz and co- 
author on the new study. 
Trampled by _ livestock and 
scorched by climate change, that 
glue is likely to become weaker in 
the future, researchers say, leav- 
ing the soil prey to wind. The 
dust study “demonstrates that the 
loss of biocrust in one area of the 
world can have wide-ranging and 
far-flung impacts,” says Rebecca 
Finger-Higgens, an ecologist at the 
U.S. Geological Survey. 
A biocrust is a hard surface 
coating or “skin,” typically a few 
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Biocrusts form from communities of lichen, bacteria, and other microbes. 


millimeters thick, containing a thriving 
community of fungi, lichen, moss, cya- 
nobacteria, and other microbes. Histori- 
cally ecologists have paid little mind to 
biocrusts, which cover soils in arid, semi- 
arid, and extremely cold places all over 
the world. But researchers have realized 
these coatings produce and process nutri- 
ents that nearby organisms need to thrive, 
especially in arid environments. They also 


te ee Reet ao, 


help a dryland soil retain its little moisture. 

In 2018, Weber, postdoc Emilio 
Rodriguez-Caballero, and their colleagues 
mapped all of Earth’s biocrusts, concluding 
they cover 12% of the land surface. They 
then teamed up with climate modelers 
and dust experts to figure out how much 
dust formation the biocrust glue currently 
averts. First, the researchers compiled data 
about how much wind it takes to destroy 
the biocrust and blow the soil 
away. They calculated dust emis- 
sions at 31 different sites, then fed 
the results into a model that pre- 
dicted worldwide dust emission— 
and how much it would rise with- 
out the biocrusts. “[They] used a 
very elegant combination of field 
data and modeling,” says Fernando 
Maestre, a dryland ecologist at the 
University of Alicante. 

Biocrusts reduce dust in the 
air by 700 million tons per year, 
Weber and her team report this 
week in Nature Geoscience. That 
amount would bury all of New 
York City under 35 centimeters of 
dust. The study “places biocrusts 
as key players in preventing dust 
emissions globally,” Maestre says. 
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Over the next 65 years, between 25% 
and 40% of these crusts will vanish, says 
Rodriguez-Caballero, now at the University 
of Almeria. Climate change that threatens 
soil organisms will account for about half 
the destruction; other damage will come 
from trampling by humans, livestock, and 
farm machinery. 

Finger-Higgens and colleagues have 
documented the impact of climate. In a 
long-term biannual survey of plots in Utah’s 
Canyonlands National Park, they found that 
biocrust lichens in particular suffer when 
temperatures increase. As Canyonlands 
temperatures rose by 0.27°C per decade, 
lichens, especially those that help convert 
nitrogen in the air to a form that other or- 
ganisms can use, have almost disappeared, 
the team reported on 11 April in the Pro- 
ceedings of the National Academy of Sci- 
ences. With less nitrogen, fewer plants can 
survive, leaving ever more bare ground and 
more dust emission, Finger-Higgens says. 

Some implications of a dustier climate 
are unclear, researchers say. The impact of 
airborne dust on temperatures depends in 
part on the size of the particles. Dust par- 
ticles provide nuclei for cloud formation 
and can cause snow to melt faster. Although 
dust helps transport important nutrients for 
plant life, it can make respiratory problems 
and other health issues worse for people. 

Up to now, dusty places like the Sahel 
were expected to get greener and less dusty 
as higher carbon dioxide levels have a fer- 
tilizing effect, but biocrust loss will likely 
counteract this process to some degree, 
Weber and her colleagues note. “Biocrust, 
dust, [and] climate all exert feedback on 
each other,’ says Diana Francis, an atmo- 
spheric scientist at Khalifa University who 
is not part of the work. 

Climate modelers have often overlooked 
how dust affects temperature and rainfall, 
says Michael Mann, an atmospheric scientist 
at Pennsylvania State University, University 
Park. The effects of diminishing biocrusts 
shouldn’t be dramatic enough to make much 
difference in global climate models, he says. 
But Joseph Prospero, an atmospheric chem- 
ist at the University of Miami, cautions that 
“there are large areas of the Earth for which 
we have essentially no information” about 
biocrusts. That may change. Weber, Maestre, 
Finger-Higgens, and other biocrust experts 
are seeking funding for standardized mea- 
surements of biocrusts around the world. 

But researchers say the need to protect 
these fragile communities is already clear. 
Reducing emissions and changing farming 
and other land use practices can help reverse 
their decline, Maestre says. “The findings 
provide strong arguments for preserving bio- 
crust communities worldwide.” 
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U.S. RESEARCH POLICY 


NIH rolls out stricter disclosure 
rules for sexual harrassment 


Institutions now legally required to inform agency if 
grantees are “disciplined” for workplace misbehavior 


By Jocelyn Kaiser 


he National Institutes of Health an- 
nounced last week that—at the di- 
rection of Congress—it is tightening 
rules for reporting sexual and work- 
place harassment by NIH-funded in- 
vestigators. Institutions will now be 
required to tell NIH if a grantee has been 
disciplined because of harassment findings. 

NIH calls the policy a “major step” to 
close loopholes that have allowed institu- 
tions to hide harassment cases. “There’s 
no question that we’re going to be hear- 
ing about cases that we either would not 
have heard about at all, or only after a sub- 
stantial delay,’ says NIH Deputy Director 
for Extramural Research Michael Lauer. 

Observers welcome the law, but they 
note that NIH’s reporting re- 
quirements still fall short of 
those at the National Science 
Foundation (NSF). That could 
leave NIH unaware of some 
ongoing investigations. 

In recent years, NIH has 
beefed up its sexual harass- 
ment rules. In 2020, it asked 
institutions to tell them if 
a requested change in a grant’s principal 
investigator (PI), other key personnel, or 
institutional manager was related to ha- 
rassment concerns. In the past 4 years, in 
response to 112 harassment findings, NIH 
has worked with institutions to remove 
92 individuals from grants. 

But that reporting rule lacked teeth, NIH 
says. Institutions were told they “should” re- 
port harassment, but NIH couldn’t require 
disclosure, NIH acting Director Lawrence 
Tabak explained in a statement. “This limited 
NIH’s awareness” of harassment and its “abil- 
ity to take necessary action,” Tabak wrote. 

Now, thanks to a provision Congress 
added to NIH’s 2022 budget, disclosure 
will be mandatory. As of 9 July, institutions 
must inform NIH within 30 days if key 
grant personnel “are removed from their 
position or are otherwise disciplined due 
to concerns about harassment, bullying, 
retaliation, or hostile working conditions,” 
the agency announced on 10 May. 


The new policy 
“is a good, helpful, 
and clear step.” 


Heather Pierce, 
Association of American 
Medical Colleges 


Institutions must use an online form 
to detail the case and how it responded. 
NIH can then take additional steps, such 
as replacing personnel. If institutions 
ignore the rule, NIH can even suspend fur- 
ther awards. 

NIH’s new policy still falls short of NSF’s 
because the foundation requires reporting 
of any “administrative action” related to a 
harassment finding or probe. That could 
include nondisciplinary actions, such as 
barring a PI from teaching while an inves- 
tigation is underway. 

Lauer, however, says he believes NIH’s 
policy will not allow misbehavior to be 
swept under the rug. “When the con- 
cerns are so serious that a person is put 
on leave, then we do know about that,” 
he says. Allegations result in formal find- 
ings just 25% to 30% of the 
time, he adds. In “a number 
of cases,” investigators find 
“no wrongdoing.” 

NIH also hears about in- 
vestigations from victims 
and others who contact the 
agency, notes Heather Pierce, 
a senior policy specialist at 
the Association of American 
Medical Colleges. Pierce says although the 
phrase “otherwise disciplined” needs clari- 
fication, overall the new reporting policy 
“is a good, helpful, and clear step.” 

Some observers, however, want NIH to 
go further by requiring institutions to dis- 
close when a harassment investigation be- 
gins, before any determination of guilt—as 
federal rules now require for research mis- 
conduct. A 2019 report from an NIH work- 
ing group recommended that the agency 
follow a “parallel process.” 

Virologist Angela Rasmussen of the Uni- 
versity of Saskatchewan, who served on 
that committee, notes that institutions are 
sometimes reluctant to produce findings or 
take disciplinary action. She worries that 
by requiring disclosure only when there is 
a disciplinary action, the new policy might 
encourage administrators to “shield those 
who have committed misconduct.” For that 
reason, she says, “It’s important they [NIH 
officials] know about investigations.” 
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Forests are expanding in 
many cooler regions, replacing 
snowy landscapes that 
reflect solar radiation with 
darker canopies that absorb it. 


ok 
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THE FOREST FORECAST 


Climate change could lead to a net expansion of global forests. 
But will a more forested world actually be cooler? 


hese are strange times for the In- 
digenous Nenets reindeer herders 
of northern Siberia. In their lands 
on the shores of the Arctic Ocean, 
bare tundra is thawing, bushes are 
sprouting, and willows that a gen- 
eration ago struggled to reach knee 
height now grow 3 meters tall, hid- 
ing the reindeer. Surveys show the 
Nenets autonomous district, an area the size 
of Florida, now has four times as many trees 
as official inventories recorded in the 1980s. 

In some places the trees are advanc- 
ing along a wide front, but 
in other places the gains are 
patchier, says forest ecologist 
Dmitry Schepaschenko of the In- 
ternational Institute for Applied 
Systems Analysis in Austria, who 
has mapped the greening of the Si- 
berian tundra. “A few trees appear 
here and there, and some shrublike 
trees become higher.” 

All around the Arctic Circle, trees 
are invading as the climate warms. 
In Norway, birch and pine are 
marching poleward, eclipsing the 
tundra. In Alaska, spruce are taking 
over from moss and lichen. Glob- 
ally, recent research indicates for- 
ests are expanding along two-thirds 
of Earth’s 12,000-kilometer-long 
northern tree line—the point where 
forests give way to tundra—while 
receding along just 1% (see map, p. 790). 

Forest gains are not confined to the far 
north. At lower latitudes, some warmer, arid 
regions are also seeing an uptick in trees, 
in part because increasing concentrations 
of carbon dioxide (CO,)—the main planet- 
warming gas—are enabling plants to use wa- 
ter more efficiently and thrive in drier soils. 
And the fertilizing effects of CO, are enabling 
existing forests to add more leaves and wood, 
increasing their biomass. 

It’s a startlingly different picture from 
what is unfolding in the tropics, where hun- 
dreds of thousands of hectares of forest are 
lost each year to chainsaws and fire, and 
climate change is stressing the remaining 
trees. But those tropical losses could be more 
than offset by gains elsewhere, some studies 
predict, leading in the coming decades to a 
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By Fred Pearce 


world with more and faster growing trees. 

That might seem like surprisingly good 
news for curbing global warming. Forests 
often have a cooling effect, releasing organic 
compounds and water vapor that promote 
the formation of clouds. And more, faster 
growing trees would absorb more atmo- 
spheric carbon and lock it away in wood. 

But the calculus of forests’ climate effects 
is far from straightforward, and emerging re- 
search suggests a more forested world won't 


Wildfire can destroy forests, but in some it can clear the way for faster 
growing trees that help curb climate change by absorbing carbon. 


necessarily be a cooler world. New forests 
could enhance warming in some areas, for 
example, by reducing the amount of sunlight 
that is reflected into space. Over time, that 
could offset any gains in carbon absorption. 

“Forests are not just carbon sponges,” says 
environmental scientist Deborah Lawrence 
of the University of Virginia. But that com- 
plexity, she adds, “is not adequately captured 
by current carbon-centric metrics.” 


TO ACCOUNT for how forests will affect fu- 
ture climate, researchers must not only tally 
current trends, such as development-driven 
deforestation, but also forecast how pow- 
erful forces such as surges in wildfire and 
warmer temperatures might affect forests, 
sometimes helping and sometimes harming 
their ability to soak up atmospheric carbon. 


Historically, researchers have focused 
much of their attention on the losses side of 
the balance sheet, for example by quantifying 
the steady erosion of tropical forests, one of 
the planet’s major carbon sinks. In the Ama- 
zon, the world’s most expansive tropical for- 
est, the news has been almost unremittingly 
bad. Overall, it has shrunk by about 18% since 
the 1970s because of deforestation. 

In 2007, meteorologist Carlos Nobre of 
Brazil’s National Institute for Space Research 
(INPE) warned the ongoing losses could flip 
the Amazon from acting as a global carbon 
sink to a significant new source. 
Simulations of the Amazon’s hydro- 
logical cycle, he found, showed de- 
forestation would make rainforests 
drier, reduce tree growth, and pro- 
mote tree losses, resulting in a net 
release of carbon to the atmosphere. 

That prediction now appears 
to have been realized, says INPE 
climate researcher Luciana Gatti. 
Drawing on measurements of at- 
mospheric carbon collected dur- 
ing 590 research flights over the 
Amazon between 2010 and 2018, 
she reported in a July 2021 Na- 
ture study that the southeastern 
Amazon—a region often called the 
“are of deforestation,” where agri- 
culture has gobbled vast swaths 
of trees—had flipped from sink to 
source. So, in consequence, had 
the Amazon as a whole. “We have hit a tip- 
ping point,” she says. 

The years since 2018 have been “even 
worse” for the Amazon’s carbon storage ca- 
pacity, Gatti says, as warming temperatures 
have compounded the effects of deforesta- 
tion. Longer dry seasons are stressing trees 
and increasing fire risks, accelerating the 
conversion of forest to more open savannas. 
Overall, the Amazon's total carbon storage 
could drop by one-third in coming decades if 
regional temperatures rise by 4°C, modeling 
studies conducted by climate scientist Chris 
Jones and colleagues at the United King- 
dom’s Met Office conclude. 

In the meantime, however, some tropi- 
cal forests are continuing to sequester large 
amounts of carbon. For example, one long- 
term field study in the lowland forests on the 
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island of Borneo recently reported that intact 
l-hectare plots, where tree deaths remain 
infrequent, hold an average of 20 tons more 
carbon today than they did in 1958, primarily 
because of CO, fertilization. 

But ongoing warming is working against 
tropical forests, even those that are still 
intact. An international study that has 
tracked 300,000 trees in more than 500 
plots of intact tropical forests over 30 years 
finds that even without deforestation, their 
ability to capture CO, peaked in the 1990s 
and has since declined by one-third. The 
decline began in the Amazon, and since 
2010 has extended to tropical Africa, says 
co-author Simon Lewis, a plant ecologist at 
University College London. Remote-sensing 
techniques that assess changes in the total 
leaf area produced by trees and other plants 
also suggest many tropical forests are slow- 
ing their carbon intake. 


THIS BLEAK PICTURE brightens outside the 
tropics. In cooler regions, research suggests 
climate trends are driving gains in both for- 
est extent and productivity that could more 
than compensate for losses in the tropics— 
“presuming,” Lawrence says, “that the world 
can meet its goals for limiting deforestation.” 
(So far, it’s not clear it will.) 

One source of optimism are studies show- 
ing higher CO, levels are already helping 
forests add biomass. For example, a widely 
cited 2016 study headed by remote sensing 
researcher Zhu Zaichun of Peking University 
found that between one-quarter and one-half 
of the planet’s vegetated places showed an 
increase in leaf area since 1982, whereas just 
4% showed a decline. Simulations by Zhu’s 
team suggest CO, fertilization accounts for 
70% of the increase in global forest biomass. 

In the future, more CO, will also prompt 
forests to expand into new areas, other 
planet-scale simulations indicate. These 
digital models allow researchers to explore 
how forests might respond to a wide array of 
factors, including changes in global temper- 
atures and atmospheric carbon concentra- 
tions. One such study, published in December 
2021 in JGR Atmospheres by climatologist 
Jennifer Kowalczyk of Brown University, 
found warming alone caused vegetation to 
decrease globally, with tropical losses from 
overheating exceeding nontropical gains 
from longer growing seasons. But that find- 
ing flipped when she added the fertilizing ef- 
fects of increased atmospheric CO,. Overall, 
boosting CO, levels to about 560 parts per 
million—or double preindustrial levels— 
increased global forest cover by 15% above 
preindustrial extent. 

Much of the simulated increase occurred 
in boreal forests of the north, where longer 
growing seasons and thawing permafrost 
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help trees gain ground. But forests also 
expanded in arid continental interiors in 
the subtropics. 

That was somewhat surprising, Kowalczyk 
says, because the greening in arid zones “oc- 
curs even without significant increases in 
precipitation.” Instead, she says, the extra at- 
mospheric CO, allows trees to reduce water 
loss, because they don’t need to open their 
stomata so wide to take in CO,. That enables 
seedlings to take root where none grow today. 


SOME RESEARCHERS question optimistic 
predictions of future forest expansion. One 
issue, they say, is that other factors could 


Greening trend 

Around the Arctic, trees and shrubs are 
encroaching on tundra thanks to warmer 
temperatures and longer growing seasons. 
From 2000 to 2020, increases in tundra 
greening (top) outstripped vegetation decreases 
(bottom), according a recent analysis of satellite data. 
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intervene. Deforestation, for example, could 
accelerate to satisfy growing global demand 
for food and resources, wiping out any 
global gains. And a shortage of key soil nu- 
trients such as phosphorus could neutralize 
CO, fertilization, especially in tropical for- 
ests, says Chris Huntingford of the UK Cen- 
tre for Ecology & Hydrology. In the Amazon, 
for example, a 2019 study published in Na- 
ture Geoscience, led by ecosystem modeler 
Katrin Fleischer of the Technical University 
of Munich, found that a lack of adequate 
phosphorus could cut forest gains from CO, 
fertilization in half. 

Another big question is how a warmer, 
drier climate will influence wildfires. Model- 

ing studies have long warned that climate 
change will increase fire risks in tropi- 
cal and temperate forests. Boreal 

forests could also see losses. In an 
early portent, Global Forest Watch 
reported earlier this year that bo- 
real forests lost more than 8 million 
hectares in 2021, 30% more than in 
2020, with wildfires mostly to blame. 

But fire could also enable some boreal 
forests to store more carbon, not less, plant 
ecologists say. That’s because regenerating 
forests can produce stands that are denser 
or comprise more vigorous species better 
adapted to fire. 

Michelle Mack, a forest ecologist at North- 
ern Arizona University, has seen this arbo- 
real phoenix in action. After fires devastated 
evergreen spruce forests in Alaska in 2004, 
their charred remains were replaced by 
faster growing and less flammable aspen and 
birch—deciduous trees that could ultimately 
store up to five times more carbon than their 
evergreen predecessors. “I thought there was 
no way these forests could recover the carbon 
they lost in the fire,’ Mack says. “But these 
deciduous trees did so rapidly.” 

This phenomenon appears to be wide- 
spread across western North America 
and in the Russian Far East, she says. 
Schepaschenko concurs. In Siberia, he says, 
fires have helped fuel the northward spread 
of forests into the tundra. “The flames re- 
move moss and lichen cover, allowing [tree] 
seeds to reach mineral soil.” 


EVEN IF models suggesting a more forested 
future are right, however, it’s not yet clear 
just how beneficial those trees might be for 
curbing global warming. 

On the plus side, there’s little doubt 
that forests can help cool the lower atmo- 
sphere. One way they do this is by mov- 
ing large amounts of moisture from soils 
into the air. A typical tree may “sweat” up 
to 100 liters of water every day, and the 
planet’s estimated 3 trillion trees release 
some 60,000 cubic kilometers each year, 
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the equivalent of flooding the entire U.S. 
land area with about 6 meters of water. 

This transpiration cools the air, as energy 
is required to convert the liquid water to va- 
por. And the released vapor, together with 
other organic compounds produced by trees, 
helps create clouds that can bring down tem- 
peratures. (CO, fertilization could reduce 
transpiration by allowing trees to use water 
more efficiently, but researchers say it will re- 
main a potent cooling force.) 

The relative roughness of a forest canopy 
also helps reduce temperatures. The leaves 
and branches cause air currents to swirl 
and mix, helping dissipate heat higher into 
the atmosphere. 

Together, these two processes currently 
help cool Earth’s surface by 0.4°C to 0.6°C, 
Lawrence says, with each contributing about 
half of the reduction. 

But it turns out forests can warm the 
planet, too, primarily by changing the reflec- 
tivity, or albedo, of land surfaces. Gleaming 
surfaces such as fresh snow have an albedo 
of 0.8 to 0.9 (on a scale from zero to one), 
meaning they bounce a lot of solar energy 
back into space. In contrast, a continuous 
canopy of broadleaf trees can have an albedo 
of just 0.15, meaning the trees absorb solar 
energy and radiate it in the form of heat. A 
canopy of conifers can have an even lower 
albedo: 0.08. 

In boreal and high-altitude regions that 
get a lot of snow, the expansion of forests is 
expected to have a major impact on albedo, 
as dark canopies replace snow-covered sur- 
faces. In arid regions, the shift can also be 
dramatic, as trees shade highly reflective 
sandy or rocky soils. But whether the warm- 
ing caused by albedo changes will ultimately 
outstrip a forest’s cooling contribution is 
likely to depend on several factors, including 
latitude, altitude, how fast the trees grow, 
and the age of the forest. 

Overall, new forests are likely to have 
their greatest warming impact at high lati- 
tudes and high altitudes, where snow cover 
is extensive and long-lasting and cold tem- 
peratures mean trees grow more slowly, 
reducing their ability to capture carbon. In 
the contiguous United States, for example, a 
study by geographer Chris Williams of Clark 
University found that over a 100-year span, 
forests growing in the Rocky Mountains will 
cause net warming, whereas those growing 
in lower, less snowy areas east of the Missis- 
sippi River and on the Pacific Coast will cause 
net cooling. 

But the impact depends, in part, on when 
during the forest’s life cycle it is measured. 
A young forest, for example, might warm the 
atmosphere because of its albedo effect. But 
it could become net cooler as the trees age 
and store more carbon. 
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In Israel, earth system scientist Dan Yakir 
of the Weizmann Institute of Science has 
been watching this balancing act play out 
in the Yatir Forest, a 2-hour drive from Tel 
Aviv. Workers created the forest in the 1960s, 
planting some 4 million Aleppo pines in 
the yellow sands of the Negev Desert on the 
slopes of Mount Hebron. Today, the forest is 
often promoted as a valuable carbon sink. 

So far, however, any climate gains are not 
clear-cut, Yakir says. His measurements of 
biomass and albedo have shown that, so far, 
the warming caused by the dark canopy of 
the pines now exceeds the cooling from their 
carbon capture. As the trees grow, however, 


in locations that are counterproductive for 
cooling the climate,” Williams says. 

Then there is the question of how govern- 
ments should account for new forests when 
they tally up their contributions to comply- 
ing with global climate agreements such as 
the Paris accord. Typically, nations are given 
credit for protecting or expanding their for- 
ests. Russia, for example, calculates that 
roughly one-quarter of its fossil fuel emis- 
sions are offset by its huge, carbon-absorbing 
forests. And Schepaschenko’s discovery that 
Russia’s boreal forests are expanding and 
storing even more carbon suggests the na- 
tion could go much further. “We have the po- 


Competing effects 

Global tree cover could expand ina 
warming climate, as gains in regions such 
as the Arctic outstrip losses in others, 
such as the tropics. But a more forested 
world might not help curb global warming, 
as forests create conditions that both 

cool and warm the planet. 


2) 


1 Lower albedo encourages warming 

Warmer climates are enabling forests to expand in 
cooler, snowy places, including the Arctic and high 
mountains. But when dark forest canopies replace bright 
snow, they reduce the reflectivity—or albedo—of the 
surface, causing solar radiation to be absorbed instead 
of bouncing back into space. 


2 Cooling from clouds and turbulence 

Forests release water vapor and organic chemicals that 
promote the formation of clouds, which contribute to 
cooling. Tree canopies create rough surfaces that cause 
air currents to swirl and help dissipate heat into the 
ower atmosphere. 


3 Carbon uptake slows warming 

Rising carbon dioxide levels in the atmosphere are 
helping fertilize tree growth. The added biomass— 
leaves and wood—can lock away more carbon and curb 


warming. But carbon uptake in some forests 


he expects the cooling influence to catch up. 
But the crossover might not occur until the 
2040s, he says—and that assumes the trees 
live that long. 

The findings were unexpected, Yakir says. 
And he cautions that the Yatir forest is, in 
some ways, unusual. For example, “The forest 
is almost black and the desert almost white. 
... Everything makes our case extreme.” 


THE UNCERTAINTY surrounding how new 
forests will affect climate poses not just a 
scientific challenge: It has policy implica- 
tions, too. Few large-scale tree planting 
projects, for example, assess the potential 
climate downsides from a changing albedo. 
As a result, such greening initiatives “could 
really backfire” if they “end up placing trees 


is slowing as global temperatures rise. 


tential to turn [new forests] into a massive 
carbon capture hub,’ the nation’s minister 
of development of the Far East and Arctic, 
Aleksey Chekunkov, told Bloomberg last year. 

But what if new forests turn out to acceler- 
ate warming over the long term, not slow it? 
Should nations still get credit? 

Such questions, scientists say, highlight 
the importance of gaining an even more 
sophisticated understanding of how for- 
ests and Earth’s climate interact. Without 
a clearer picture, Williams says, “I really 
worry that we could be placing too much 
emphasis on forests as a climate solution, 
when what we really need is deep decarbon- 
ization of society.” 


Fred Pearce is a journalist based in London. 
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Managing forests for competing goals 


Tree plantations face difficult trade-offs between production and ecological goals 


By Jessica Gurevitch'” 


rees provide beauty, shade, food, and 
lumber. Forests of trees are reservoirs 
of biodiversity, sustaining a web of life 
while sequestering carbon, modulat- 
ing regional climates, conserving soil, 
retaining water from rainfall, and 
moderating runoff. Concerns about forest 
losses and climate change have motivated 
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forest replanting to support ecosystem ser- 
vices, such as carbon storage, soil conserva- 
tion, recapture of water from rainfall, and 
lumber production (1). So, how well can tree 
plantations achieve the different goals they 
are billed as providing compared with native 
forests? On pages 839 and 865 of this issue, 
Hua et al. (2) and Feng et al. (3), respectively, 
provide insight into the answers to this prob- 
lem. Hua et al. examine whether plantations 
purposed for lumber production can also 
provide a refuge for biodiversity and achieve 
other ecological goals, and Feng et al. con- 
sider whether productivity in such planta- 


tions can benefit from multispecies designs 
rather than the typical monoculture stands. 

There has been long-standing uncer- 
tainty about how the human use of trees 
compromises the value of forests for bio- 
diversity conservation and ecosystem ser- 
vices. Can a scenario exist in which one can 
plant trees profitably for lumber produc- 
tion while simultaneously promoting biodi- 
versity and supporting ecosystem services? 
Reforestation often replaces previous na- 
tive forests with non-native, low-diversity 
tree plantations (4). Hua et al. compared 
plantations with native and restored forests 
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to quantify the trade-offs between wood 
production and biodiversity, the ecosystem 
services of carbon storage, soil conserva- 
tion, and water provisioning. They paired 
site data on plantations and native forests 
using 25,950 matched data pairs from 53 
countries, encompassing presence and 
abundance data on a wide range of organ- 
isms, from bats and terrestrial mammals to 
reptiles and amphibians, birds, arthropods 
and other invertebrates, and many different 
growth forms of plants. 

Unfortunately, according to their analy- 
ses, achieving the competing goals of for- 
est production and support for biodiversity 
and ecosystem services appears to be elu- 
sive. Hua et al. report that tree plantations 
produced more than twice the lumber pro- 
duced by native forests and secondary for- 
ests. Plantations of single species of trees, 
known as monocultures, had far less biodi- 
versity of animals and plants, much poorer 
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Single-species tree farms, such as this Eucalyptus 
plantation in Sao Paulo, Brazil, negatively affect 
biodiversity and the ecosystem benefits provided 
by native forests (seen in the top left). 


erosion control, worse water yield, and 
lower biomass and carbon storage. Globally, 
tree plantations often all use the same small 
number of tree species, sometimes growing 
millions of genetically identical clonal indi- 
viduals. The data of Hua et al. reveal that 
monoculture and low-diversity tree planta- 
tions may be good at providing lumber, but 
they do not achieve goals for biodiversity 
and ecosystem services. There is a marked 
trade-off between those goals and profitable 
lumber production—low-diversity forest 
plantations do not preserve biodiversity or 
provide carbon storage sufficient to combat 
climate change effectively. 

And yet, there may be a way out, if only 
partially. Feng et al. synthesized a dataset 
from several hundred publications across 
six continents to determine whether multi- 
species planting had advantages over mono- 
cultures in productivity. They found that 
growth and productivity were substantially 
greater when multiple tree species were 
grown together compared with monocul- 
tures, echoing findings in natural systems of 
positive relationships between productivity 
and plant biodiversity. They attribute their 
results to niche complementarity when dif- 
ferent plant forms—such as deciduous and 
needle-leafed trees—are grown together. 
Collectively, the studies by Hua et al. and 
Feng et al. suggest that more-complex, mul- 
tispecies plantations may offer the promise 
of higher productivity while at least in part 
supporting biodiversity and providing eco- 
system services. Replacing native forests 
with plantations, even multispecies planta- 
tions, cannot fully restore biodiversity, but 
well-designed forest restorations may sup- 
port important ecological and conservation 
goals while providing economic value. 

Lumber plantations have replaced natu- 
ral forests at an almost unimaginable scale 
worldwide. Almost 14 billion trees have 
been planted in 74 countries since 1961 (4). 
The most commonly planted species are the 
most commercially profitable—cacao, teak, 
moringa, mango, and coffee. Although, in 
total, hundreds of tree species are planted 
in commercial plantations, this pales in 
comparison to the tens of thousands of na- 
tive tree species—more than 15,000 of which 
face extinction (5). Plantations of single 
Eucalyptus species native to Australia in 
Africa, Asia, and South America for timber, 
paper, fuel, and sometimes carbon storage 
can decimate native biodiversity. The same 
is true for the extensive plantations of pines 
native to North America in Australia, New 


Zealand, and parts of South America and 
South Africa. The non-native pines disperse 
from plantations into native forest commu- 
nities and can become dominant, displacing 
native species. The North American Douglas 
fir is planted over large expanses in the 
Netherlands and other parts of Europe. Non- 
native Acacia and Syzygium species have 
been widely planted in continental Africa 
and Madagascar, where they can be highly 
invasive and use excessive amounts of soil 
water to the disadvantage of native species. 

Looking beyond lumber plantations, the 
picture for forest losses may be even bleaker, 
from the vast plantations of oil palms that 
have replaced high-diversity tropical forests 
in Southeast Asia to the deforestation and 
fragmentation of native forests for cattle 
pasture. These replacements of native for- 
ests pose grave threats to biodiversity and 
ecosystem services (6, 7). 

Loss of the complexity of tree species 
diversity in native forests threatens other 
species that live in and depend on those 
forests, from birds and invertebrates to fish, 
mammals, and even humans. Other analy- 
ses are in agreement with the general find- 
ings that economic productivity trades off 
against biodiversity preservation and eco- 
system services (6, 7), although perhaps the 
works by Feng et al. and Hua et al., taken to- 
gether, suggest that there are strategies that 
might better achieve both economic and 
ecological goals. The growth in both com- 
mercial and nonprofit organizations plant- 
ing trees has been accelerating worldwide 
(4). Although planting trees sounds like a 
good thing, whether the goals are providing 
livelihoods for local communities, profit for 
corporations, or carbon offsets, the results 
may not be as positive as anticipated. How 
well those efforts will effectively combat cli- 
mate change, biodiversity loss, and the loss 
of other ecosystem services remains to be 
seen, but the prognosis—at least for mono- 
cultures and low-diversity plantations—is 
not encouraging. & 
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Mitochondrial complex complexification 


Variation in complex composition provides clues about the function of individual subunits 


By Martijn A. Huynen and Dei M. Elurbe 


ife’s universal energy currency, ad- 
enosine triphosphate (ATP), is mainly 
produced in mitochondria by a series 
of distinct protein complexes that 
catalyze oxidative phosphorylation 
(OXPHOS). These complexes also 
exist in a-proteobacteria, the ancestors of 
mitochondria, but somehow their size and 
subunit count (complexity) has increased 
substantially since the endosymbiosis of a 
bacterium with a host cell at the origin of 
the eukaryotes. For example, 
complex I, which catalyzes 
the first step of OXPHOS, has 
increased from 17 subunits 
in the bacterial ancestors of 
mitochondria to 45 subunits 
in mammals (7). Why do the 
eukaryotic complexes contain 
these extra, supernumerary 
subunits when their primary 
function can be performed in Fungi 
bacteria with less subunits? On (43) 
page 831 of this issue, Zhou et al. 
(2) report the structures of the 
OXPHOS complexes I, III, and 
IV in the single-celled eukary- 
ote Tetrahymena thermophila. 
Comparing these _ structures 
with previously known ones 
suggests new hypotheses about 
the adaptive value of supernu- 


merary subunits. Fungi 
T. thermophila has all five (4) 


OXPHOS complexes; is evo- 
lutionarily very distant from 
model eukaryotic species 
for which there are already 
OXPHOS structures (see the 
figure); and has inner mito- 
chondrial membranes in which 
these structures reside that 
are quite different from the membranes in 
those model species, making it a relevant 
model species itself. Zhou et al. show that 
complexes I and IV in T: thermophila stand 
out: by their much larger numbers of extra 
subunits than have been observed in other 
eukaryotes, because most of these subunits 
have been added later in evolution than in 
other eukaryotes, and because some of the 
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supernumerary subunits are encoded in 
the mitochondrial DNA (mtDNA) rather 
than the nuclear genome. These increases 
in complexity are consistent with what has 
been observed for complex V in this spe- 
cies (3), as well as from other species in 
the alveolate branch of the eukaryotic tree, 
such as the malaria parasite Plasmodium 
falciparum and the toxoplasmosis parasite 
Toxoplasma gondii (4-6). 

One hypothesis about the function of the 
extra subunits is that they do not adapt the 
function of the complex itself, but instead 


Evolving supernumerary subunits 
The evolution of the number of subunits (shown in parentheses) 
in oxidative phosphorylation complexes | and IV in three branches of eukaryotes 
from the last eukaryotic common ancestor (LECA) is shown. Data are 
based on protein structures (1, 2, 10), results from complexome profiling (5), 


and evolutionary reconstructions (2, 5). 


Complex | 
Mammals Plants T. thermophila 
(45) (48) (64) 


Opisthokonta Alveolates 
(42) (46) 
LECA 
(46) 
a-proteobacteria (17) 
Complex IV 
Mammals Plants T. thermophila 
(14) (12) (52) 
Opisthokonita Alveolates 
(13) (11) 
LECA 
(12) 


a-proteobacteria (3) 


allow the transfer of OXPHOS genes from 
the mtDNA to the nuclear genome, where, 
at least in animals, the mutation rate is 
lower than in the mitochondrion. The most 
concrete evidence for this exists for com- 
plex V of 7: gondii (4), in which subunit-a 
that is encoded on mtDNA in other species 
is encoded in the nuclear genome. Parts 
of the gene encoding subunit-a that have 
been lost in the process of nuclear genome 
transfer have been replaced by the addition 
of subunits in the complex. However, in 7: 


T. gondii and 
P falciparum 


thermophila, the mtDNA is relatively rich 
in genes and encodes three of the supernu- 
merary OXPHOS subunits, arguing against 
such a compensatory mitonuclear evolution 
process in this species. 

A second hypothesis for complexification 
is that the extra subunits serve to assemble 
the large protein complexes into the inner 
mitochondrial membranes. Evidence sup- 
porting this idea is that human complex I 
fails to assemble in the absence of the ad- 
ditional subunits (7). The T: thermophila 
structure of complex IV reported by Zhou 
et al. provides more specific 
support for this hypothesis 
because it includes subunits 
that serve as assembly factors 
in other species, where they 
only temporarily associate with 
complex IV. These include pro- 
teins that are homologous to 
the proteins surfeit locus pro- 
tein 1 (SURF1) and cytochrome 
c oxidase assembly protein 20 

(0) (COX20) that function in the 
a assembly of complex IV in hu- 
mans and yeast. A quite reveal- 
ing example of this incorpora- 
tion of assembly factors into 
the complex are proteins that 
insert so-called mitochondrial 
carrier proteins into the mito- 
chondrial membrane. Zhou et 
al. show that in T: thermophila, 


T. gondii and 
Pinar three mitochondrial carrier 
(23) proteins have become part of 


complex IV, as well as six pro- 

teins that are involved in their 

insertion into the membrane. 

It is as if those former subunits 

took along their own assembly 

factors when they became part 

of complex IV. Nevertheless, 

although there are now spe- 

cific hypotheses for the function of these 
extra subunits, the adaptive value of mak- 
ing them permanent members of the com- 
plexes, instead of only using them during 
complex assembly, is unknown. After all, 
such assembly factor subunits can now only 
be used once per complex in T: thermophila. 
However, there is one class of additional 
subunits for which an adaptive function 
is apparent from the structural analysis of 
Zhou et al. In mammals, fungi, and plants, 
the inner mitochondrial membranes are 
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relatively flat, disk-like structures, but in 7: 
thermophila, they form tubular structures. 
The structure of the supercomplex that con- 
tains complex I and complex III appears 
curved, fitting well on the tubular mem- 
branes of mitochondria in T: thermophila 
and potentially causing their curvature. 
Two of the supernumerary subunits appear 
to form a wedge between complexes I and 
III, such that the parts of the supercom- 
plex within the membrane follow its curve. 
Bending the mitochondrial membrane is a 
common theme in complex V of OXPHOS 
(8), including in T: thermophila (3) and in 
T. gondii (4), but the findings of Zhou et al. 
now also reveal a curved shape for com- 
plexes I and III. Nevertheless, what remains 
puzzling is the large variation in the compo- 
sition of the OXPHOS complexes between 
T. thermophila, and P. falciparum and T.: 
gondii, because their inner mitochondrial 
membranes have all been classified as tubu- 
lar (9). The variations between these mem- 
branes are, however, quite large, because 
the T. gondii membranes have knob-like 
structures that have not been observed in 
T. thermophila and that are caused by a dis- 
tinct subunit of T. gondii complex V (3). 
The view of mitochondrial OXPHOS 
complex evolution has, until recently, 
mainly been determined by that of the 
fungi, animals, and plants—which to- 
gether represent only two of the six main 
branches in the eukaryotic tree of life— 
with relatively similar inner mitochondrial 
membranes and complex compositions. 
Elucidation of the composition and struc- 
ture of these complexes in other branches 
of the tree of life, as in Zhou e¢ al., includ- 
ing those with different inner mitochon- 
drial membrane shapes (9), variation in 
the shape of mitochondrial membranes 
during their life cycle (5), and varying 
mtDNA gene content, as well as elucidating 
covariation of these proteins with membrane 
shape and mtDNA gene content will provide 
more specific hypotheses to experimentally 
test and explain the enigmatic complexifica- 
tion of mitochondrial OXPHOS complexes. 


REFERENCES AND NOTES 


1. K.Fiedorezuk et al., Nature 538, 406 (2016). 

2. L.Zhouetal., Science 376, 831 (2022). 

3. R.K.Flygaard, A. Muhleip, V. Tobiasson, A. Amunts, Nat. 
Commun.11, 5342 (2020). 

. A.Muhleipetal., Nat. Commun.12,120 (2021). 

F. Evers et al., Nat. Commun. 12,3820 (2021). 

A.E. Maclean et al., PLOS Pathog.17,e1009301 (2021). 

D.A. Stroud et al., Nature 538, 123 (2016). 

. W.Kthlbrandt, Annu. Rev. Biochem. 88,515 (2019). 

. T.Panek etal., Curr. Biol. 30,R575 (2020). 

. V.Zickermann et al., Science 347, 44 (2015). 


DOM UIHAMAS 


a 


ACKNOWLEDGMENTS 
M.A.H. thanks F. Evers for helpful comments. M.A.H. is sup- 
ported by ZonMw TOP grant no. 91217009. 


10.1126/science.abq0368 


SCIENCE science.org 


Imprints of fossil coccoliths discovered in black shales reveal that calcifying microalgae remained prominent 
producers during previous extreme ocean conditions. 


Fossil imprints from 
oceans of the past 


Marine phytoplankton show notable resilience 


during extreme ocean states 


By Jorijntje Henderiks 


hroughout the long opera of Earth’s 
oceans, algal blooms have dotted 
across the seascape like a cacophony 
of pop-up solos. Phytoplankton—the 
microalgae at the bottom of marine 
food webs—compete with each other 
for sunlight and nutrients to sustain pho- 
tosynthesis and turn inorganic carbon into 
organic matter. The occasional blooms of 
these microalgae, often at scales visible 
from space, are followed by a chain of life 
and decay, as different species begin and 
end their life cycles, following each other 
in close succession. Phytoplankton play a 
pivotal role in global biogeochemical cycles 
and feedbacks on climate and ocean chem- 
istry. On page 853 of this issue, Slater et al. 
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(1) provide evidence that coccolithophores, 
a group of calcifying haptophyte algae, re- 
mained prominent producers throughout 
prolonged periods of ocean warming and 
oxygen depletion in the past, challenging 
the assumption (2) that they had suffered 
“calcification crises” under previous ex- 
treme ocean conditions. 

Although far from being fully understood, 
it is known that Jurassic and Cretaceous 
oceanic anoxic events (OAEs) lasted for 
a few hundred thousand years and are all 
linked to global warming, increased atmo- 
spheric carbon dioxide (CO,), and pertur- 
bations of the carbon cycle at the time (3). 
Increased marine production, fueled by 
higher nutrient supply to the oceans known 
as eutrophication, was a key control (4) in 
maintaining poorly oxygenated waters and 
enhanced burial of organic matter. The 
decay of excess organic matter consumed 
nearly all dissolved oxygen in the deep 
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ocean, whereas the resulting anoxic condi- 
tions at the seafloor promoted the recycling 
of phosphate, an important nutrient. Such 
reinforcing feedback loops kept primary 
production at an increased level for a long 
time, before the increased burial of organic 
matter into the sediments was able to bal- 
ance and, eventually, decrease excess atmo- 
spheric CO,,. 

The black shales—dark, organic-rich 
shale deposits—that first drew attention to 
the OAEs and their abundant fish fossils 
(5) have now given up yet another of their 
long-kept secrets. As confirmed by Slater 
et al., black shales have a low carbonate 
content and thus render very low numbers 
of fossil coccoliths, which are the calcite 
scales of coccolithophores. However, the 
authors prepared their rock samples with 
another purpose in mind—namely, to in- 
vestigate different types of organic mat- 
ter, including pollen and organic-walled 
plankton, which requires dissolving rocks 
in acids. Because calcite readily dissolves, 
researchers usually stay clear from acids 
when studying coccoliths, which are quan- 
tified from untreated rocks. Herein lies a 
beautiful example of scientific serendipity: 
Instead of the coccoliths themselves, the 
exquisitely illustrated results reveal their 
micrometer-scale imprints pressed into 
the surfaces of organic matter. 

It is no surprise that because of the de- 
structiveness of this method, this mode 
of coccolith preservation has been largely 
hidden from researchers. Slater et al. have 
shown this to be a common feature in 
Mesozoic black shales, including carbon- 
ate-free samples, from widely separated 
locations and across OAEs of different 
ages. Most of the coccoliths that arrived 
at the seafloor were thus dissolved after 
they were buried within organic-rich sedi- 
ments, which served as a mold for their re- 
maining imprints. The findings of Slater et 
al. contradict the assumptions that lower 
numbers of coccolith fossils across OAEs 
reflect a primary signal of decreased coc- 
colithophore calcite export production and 
what some have called “abundance crises” 
in the photic zone (2). The fossil imprints 
confirm undoubtedly that surface water 
conditions did not impair intracellular 
calcification by coccolithophores during 
OAEs, at least not during their blooms. 

The resilience of coccolithophores and 
other algae may be related to the peri- 
odicity of these short-lived blooms. Most 
microalgae have evolved successful ways 
to survive suboptimal or extreme condi- 
tions, usually by producing resting stages 
or switching between life phases with dis- 
tinct ecophysiological traits. Still, global 
algal biomass may have shifted from coc- 
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colithophore-dominated, and_ therefore 
chalk-forming, producers to other noncal- 
cifying producers. For example, nitrogen- 
fixing cyanobacteria and green algae also 
contributed to primary production during 
OAEs (2, 3), but the actual proportions are 
difficult to quantify, given the selective 
preservation biases of fossil groups and 
between different sediment types. 

The precise mechanisms and timing for 
the postburial coccolith disappearance act 
will need further exploration. For instance, 
the much younger organic-rich sediments 
called sapropels in the Mediterranean 
typically contain abundant coccoliths (6, 7) 
and may provide some clues. If treated to 
an acid bath, such sediments may or may 
not reveal similar imprints and be a test 
for the idea that a certain degree of over- 
burden pressure and lithification was re- 
quired for their formation. 

Even with its inherent preservation bi- 
ases and “noise-canceling” properties, the 
sedimentary record is still the only way 
to gauge the long-term consequences of 
climate change on marine ecosystems. 
Organisms higher up the food web, such 
as the nekton and benthos, saw their habi- 
tats substantially diminished during OAEs 
(2, 3) and sapropel formation (6) for thou- 
sands of years, which may be related to 
phytoplankton biomass resilience and sta- 
bility (8). The Baltic Sea, a modern analog 
for marine anoxia, is experiencing severe 
eutrophication (9), creating conditions 
that support large cyanobacteria blooms in 
the surface waters but suffocating marine 
life below. In reality, the web of feedback 
loops around primary production is much 
more complex, but a key point is that the 
reinforcing and balancing feedbacks op- 
erate on very different time scales, which 
determines the outcomes. Even though 
phytoplankton prove resilient to climate 
extremes, a full recovery after marine eco- 
system disturbance may be a long wait. 
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T cell immune 
responses 
deciphered 


A machine-learning 
approach reveals antigen 
encoding that predicts 

T cell responses 


By Armita Nourmohammad?23 


n immune response involves a co- 
ordinated orchestra of antigen- 
recognizing cells (e.g., T cells) and 
signaling molecules to mount a 
specific response against a patho- 
gen. Although systems immunology 
offers a growing list of molecular interac- 
tions that are involved in antigen-specific 
immune responses, an understanding of 
how a response is mediated by different 
antigen characteristics is still lacking. On 
page 880 of this issue, Achar e¢ al. (1) ad- 
dress this question by using a robotic plat- 
form to survey a broad range of functional 
T cell responses to different antigen stimu- 
lations. Using machine learning, they con- 
struct a simplified map that separates six 
different stereotypical classes of antigen-de- 
pendent immune responses. Understanding 
this antigen-encoding could help guide im- 
munotherapy, including engineering chi- 
meric antigen receptor (CAR)-T cells and 
identifying vaccine antigens. 
Discriminating between an organism’s 
self-molecules and foreign (nonself) an- 
tigens is the hallmark of adaptive immu- 
nity. To achieve such specificity, the cur- 
rent model of T cell development in the 
thymus proposes that cells with very high 
reactivity to self-molecules (strong ago- 
nists) should be negatively selected, those 
with no reactivity (nonagonists) should 
die from neglect, and those with moderate 
reactivity (weak agonists) should be posi- 
tively selected and enter peripheral tissues 
(2). This classification has led to the con- 
cept of antigen quality as a predictor for 
the efficacy of adaptive immune responses. 
Antigen-specific immune responses are 
highly sensitive to even a small amount 


‘Department of Physics, University of Washington, Seattle, 
WA, USA. *Fred Hutchinson Cancer Research Center, Seattle, 
WA, USA. *Max Planck Institute for Dynamics and Self- 
Organization, Gottingen, Germany. Email: armita@uw.edu 


science.org SCIENCE 


GRAPHIC: V. ALTOUNIAN/SCIENCE 


of foreign antigens. To achieve 
such a degree of sensitivity 
and specificity, T cells sense 
the level of immune activity in 
their environments (quorum 
sensing) through interactions 
with signaling molecules and 
use this collective information 
to gauge the severity of a threat 
associated with an antigen 
(3). As a result, a mechanistic 
model of antigen-specific im- 
mune responses could involve 
a large number of cellular and 
molecular interactions with 
feedback, which are at least 
partially unknown. 

However, complexity at 
the interaction level does not 
necessarily imply a lack of 
simplicity at a higher (coarse- 
grained) level. Indeed, simplic- 
ity is an emergent property of 
many biological systems with 
strongly interacting parts. This 
is reflected in the predictabil- 
ity of coarse-grained molecu- 
lar characteristics in rapidly 
evolving populations (4, 5) or 
reproducibility in the structure 
of complex ecological commu- 
nities (6). Achar et al. took a 
similar point of view to gener- 
ate a predictive coarse-grained 
model of antigen-specific im- 
mune responses to pathogens. Their top- 
down approach is data-driven and enabled 
by the robotic platform with which they 
trace many immune response profiles (i.e., 
dynamics of different cytokine signaling 
molecules) in mouse and human T cell 
cultures upon exposures to different an- 
tigens (see the figure). The key to gener- 
ating such a model is to find an effective 
representation of an immune response. 
A powerful technique in machine learn- 
ing is to produce reduced (latent) repre- 
sentations from models trained on large 
amounts of data and relate these high-level 
representations to identifiable features 
(7). For example, in deep-learning models 
of facial recognition, latent representa- 
tions correspond to the eyes and nose in 
an image. 

Achar et al. found a latent representa- 
tion that separates immune response pro- 
files based on the quality of the presented 
antigens, independent of their quantity. 
This observation implies that equilibrium 
binding association and dissociation of a 
T cell receptor (TCR) and an antigen can- 
not be a good proxy for antigenicity—in 
equilibrium, a high binding probability of a 
TCR to an antigen can be achieved even for 
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T cell proliferation and 
secretion of signaling 
molecules 


Antigen quality predicts T cell response 
T cells can detect small amounts of pathogen-derived antigens and respond by 
proliferating and secreting different signaling molecules. A neural network trained 
on the complex profiles of signaling molecules over time reveals a simplified 

map that separates immune responses into six different classes, dependent on 
the quality of the presented antigens. 


ar 


Learning a reduced 
representation of 
the T cell response 


Complex profiles of signaling 
molecules (gray) in response 
to different antigens 


low-affinity antigens if they are available 
at large quantities. Indeed, antigen recog- 
nition by T cells goes beyond equilibrium 
binding and involves kinetic proofreading 
mechanisms, whereby two or more antigen 
recognition events are combined to assure 
the fidelity of a response (i.e., the interac- 
tions are kinetically proofread) (8-10). This 
mechanism is particularly crucial because 
self-antigens are present at much higher 
concentrations than nonself antigens, and 
their dissociation time from TCRs is only 
a few seconds shorter than that of nonself 
antigens (i.e., they have comparable bind- 
ing constants). 

The structure of the inferred latent rep- 
resentation reflects the amount of informa- 
tion encoded by biologically plausible im- 
mune profiles. Achar et al. found that the 
inferred latent representation can associ- 
ate immune response profiles to six differ- 
ent classes of antigens, which goes beyond 
the three conventional antigen classes of 
strong agonists, weak agonists, and nona- 
gonists. It remains to be seen how these 
six classes are related to the biologically 
meaningful molecular features of antigens. 

Learning a latent representation from 
high-dimensional data can also be used to 


Simple and predictive 
antigen-dependent model 
of the immune response 


identify relevant parameters 
for a system, allowing predict- 
ability (12). The artificial intel- 
ligence (AI)-guided approach 
used by Achar et al. has likely 
identified the appropriate repre- 
sentation in which the profile of 
an immune response can be pre- 
dicted by a simple model with 
few parameters. The remaining 
step is to find a biological inter- 
pretation for these parameters, 
which would enable experimen- 
tal manipulation of the system, 
whereby a molecular environ- 
ment (e.g., availability of differ- 
ent cytokines) could be designed 
to elicit a desired immune re- 
sponse from a given antigen. 
The importance of AI-guided 
design of immune and an- 
tigen environments for bio- 
medical interventions cannot 
be overstated. A controlled 
manipulation of an immune 
environment could enhance 
the success of an anticancer 
CAR-T cell immunotherapy by 
enabling a neoantigen derived 
from a patient’s tumor to elicit 
a desired immune response. 
The model can also inform 
antigen design protocols for 
immunotherapies and vaccine 
development. In addition, such 
a top-down data-driven approach to mod- 
eling the immune system provides a frame- 
work for constructing tractable theories for 
complex biological puzzles, from cracking 
the code of cell-fate differentiation during 
development to community assembly and 
resource allocation in ecological settings. 
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Can internet use become addictive? 


Problematic internet use parallels drug addiction, but the mechanisms are not yet clear 


By Matthias Brand! 


Ithough the internet provides many 

opportunities, there is a downside, 

which is that some people suffer 

from diminished control over their 

use of certain internet applications. 

There are reports of young people 
gaming for days without rest, adequate 
hydration and nutrition, or social contact 
outside of gaming. This has raised concerns 
about problematic internet use that results 
in functional impairment and 
distress in daily life. The inter- 
net applications that are often 
used problematically deliver 
pleasure and enable the reduc- 
tion of negative mood. These 
responses show parallels with 
the effects of addictive drugs 
on the brain’s reward system. 
Additionally, compulsive us- 
age patterns may develop. Self- 
control is an important factor 
in whether pleasure and com- 
pulsion to use certain internet 
applications can drive problem- 
atic use. Given the increased 
use of the internet in this era 
of social distancing and remote 
working, it is important to as- 
sess harm caused by online ad- 
dictive behaviors. 

In the 11th revision of the 
International Classification of 
Diseases (ICD-11) by the World 
Health Organization, effective 
from January 2022, two dis- 
orders due to predominantly 
online addictive behaviors can 
be diagnosed: gambling and 
gaming disorders. Other poten- 
tially problematic online behaviors are be- 
ing debated: problematic pornography use 
may be considered a subtype of compul- 
sive sexual behavior disorder in the ICD-11, 
whereas problematic social-networks use 
and problematic buying-shopping can be 
diagnosed as “other specified disorders due 
to addictive behaviors” (1). Globally, up to 
3% of adolescents and young adults suffer 
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Becoming addicted S 
The “feels better” path is probably \ 
central to early stages of the addiction 
process. In later stages, the “must do” 

path becomes involved. 


from gaming disorder (2), and ~7% of gen- 
eral populations have unspecified internet- 
use disorders (3). For gaming disorder, be- 
ing younger and male increases the risk, 
but the choice of diagnostic instruments 
causes substantial variation in estimating 
prevalence (2), making it necessary to es- 
tablish a standard diagnostic assessment. 
Negative consequences of online addictive 
behaviors include, for example, neglect of 
school or professional duties, abandon- 
ment of hobbies or leisure activities, and 


Neural pathways in addiction 
The hypothesized main neural circuits in human addictive behaviors are shown. 
The “feels better” path includes positive and negative reinforcement experiences 
and involves the ventral striatum (nucleus accumbens), and the dorsal striatum 
in later stages. The “must do” path includes primarily compulsive behaviors, 
involving the dorsal striatum (putamen and caudate nucleus). The “stop now” 
self-control process mainly involves the dorsolateral prefrontal cortex. 
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jeopardizing social relationships. Given in- 
creased internet use, it is important to bet- 
ter understand resilience and risk factors 
that may distinguish between a functional- 
enriching and a problematic-addictive us- 
age of the internet. Moreover, although the 
clinical relevance of gaming disorder has 
been established, there are still many un- 
answered questions regarding etiology and 
neurobiological mechanisms. 

There are parallels among the inter- 
net applications that are used addictively. 
They probably target evolutionary precon- 
ditioned reward mechanisms (related to 
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behaviors that affect survival, such as sex, 
acquisition of goods, social connectedness, 
and discovery) and therefore deliver plea- 
sure, resulting in positive reinforcement of 
the behavior (4). In addition, application 
usage can also cause negative reinforce- 
ment, such as avoidance of negative moods, 
which is also known from substance use 
(e.g., using alcohol to cope with stress). The 
reward and relief mechanisms may explain 
why specific internet applications are more 
likely to be used addictively than others, but 
they do not explain why some 
individuals develop addictive 
use, whereas most users do not 
and are able to use these appli- 
cations in an enriching manner. 

Current theoretical consider- 
ations of the development and 
maintenance of online addic- 
tive behaviors (5) are mainly 
based on theories and findings 
from substance-use research, 
including those related to di- 
minished self-control (6), as 
well as hypotheses on the po- 
tential shift from reward-ori- 


\ ented to compulsive behaviors 


(7). There are two driving paths 
) to online addictive behaviors 
that may increase the engage- 


f ment in online activities: the 


“feels better” and the “must do” 
paths (see the figure). The feels 
better path includes both posi- 
tive (pleasure, reward) and neg- 
ative (reduction of stress and 
negative mood) reinforcement 
experiences and related crav- 
ings and desires. The “must do” 
path (which probably occurs 
later in the course of becom- 
ing addicted) includes habitual behaviors 
(automatic responses to stimuli), and espe- 
cially compulsive behaviors (7). Compulsive 
behaviors describe the maladaptive persis- 
tence of a behavior despite being aware of 
its negative consequences. A “stop now” 
process, constituted by the exertion of self- 
control (including inhibitory control and 
other executive functions), may regulate the 
two driving paths to addiction. The main 
hypothesis expressed in this simplified 
model is that individuals with online addic- 
tive behaviors have an imbalance between 
the driving paths and self-control. 


ter” pathway 
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Recent neuroimaging and neurocognitive 
studies have intensively investigated po- 
tential behavioral and neural correlates of 
cue reactivity and craving in online addic- 
tive behaviors and have demonstrated the 
involvement of both the ventral and dorsal 
striatum [for gaming disorder (8), problem- 
atic pornography use, social-networks use, 
and buying-shopping (/)]. Furthermore, 
meta-analyses of cognitive functions in 
individuals with unspecified internet-use 
disorders have found evidence for dimin- 
ished decision-making and other executive 
functions related to the prefrontal cortex 
(9). This is consistent with a meta-analysis 
of structural brain changes in unspecified 
internet-use disorders showing gray mat- 
ter volume reductions in the dorsolateral 
prefrontal cortex, anterior cingulate gyrus, 
and supplementary motor area (10). Meta- 
analyses on resting-state functional connec- 
tivity (71) for adolescents with gaming disor- 
der have demonstrated hypoconnectivities 
related to limbic-frontal and fronto-parietal 
networks (among others), which may in- 
dicate blunted self-control in combination 
with increased reward sensitivity. The avail- 
able findings, which are not exhaustive and 
not always consistent, give preliminary sup- 
port for parallels between online addictive 
behaviors and substance-use disorders. 

However, there are still many challenges 
in understanding the psychological and 
neurobiological mechanisms involved in 
online addictive behaviors. In particular, 
vulnerability, causality, and specificity are 
considered important. Regarding poten- 
tial vulnerability, a genetic predisposition 
for online addictive behaviors has been re- 
ported, and polymorphisms (in most cases 
single-nucleotide polymorphisms) in can- 
didate genes related to, for example, dopa- 
mine receptors, serotonin transporters, and 
nicotinic acetylcholine receptors have been 
identified (72). These neurotransmitters, 
most prominently dopamine, are involved 
in prefrontal-striatal loops, including the 
ventral and dorsal striatum. Further poten- 
tial risk factors include negative early child- 
hood experiences and insecure attachment 
style, personality traits (e.g., impulsivity), 
dysfunctional emotion regulation, stress 
vulnerability, low social support, and prob- 
lematic family background. Self-control, 
assessed through questionnaire, has been 
found to be the most reliable protective fac- 
tor for gaming disorder in a recent meta- 
analysis (13). These potential risk and resil- 
ience factors, however, are not specific for 
online addictive behaviors. They are also in- 
volved in drug addictions (74) and in other 
mental disorders [e.g., depression, anxiety, 
and attention deficit and hyperactivity dis- 
order (ADHD)], which frequently co-occur 
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with online addictive behaviors. Identifying 
more specific vulnerability factors and de- 
fining how risk and resilience factors may 
determine reward and relief experiences 
related to the use of specific internet appli- 
cations may help to differentiate between 
enriching and problematic usage patterns. 
Another major challenge is answering 
the question of causality related to brain 
mechanisms involved in online addictive 
behaviors, specifically the proposed imbal- 
ance between driving paths and self-con- 
trol. The structural and functional brain ab- 
normalities reported to be associated with 
gaming disorder and other online addictive 
behaviors may arise from neural plasticity 
and adaptations that occur while addiction 
is forming, but they may also be present 
earlier. Reduced prefrontal functioning may 
be a vulnerability, but may decrease further 
when limbic circuits become stronger in 
the course of the addiction process. Because 
most studies compare individuals with and 
without online addictive behaviors or are 
correlational, the current knowledge about 
causality is limited. Studies with prospec- 
tive designs and with, for example, siblings 
(with and without gaming disorder or other 
online addictive behaviors), as has been 
done in the context of stimulant-use disor- 
der (15), are needed to discover the neuro- 
biological and psychosocial vulnerabilities 
and causality of online addictive behaviors. 
Regarding specificity, many studies used 
paradigms from research on substance- 
use disorders and investigated potential 
parallels, which was theoretically most ap- 
propriate and also necessary to justify the 
inclusion of gaming disorder into the ICD- 
11 as a disorder due to addictive behaviors. 
A challenge for future studies is to iden- 
tify the neurobiological and psychological 
mechanisms that are specific to online 
addictive behaviors. Systematic compari- 
sons with impulse-control disorders and 
obsessive-compulsive disorders are needed 
to evaluate the involvement of impulsivity 
and compulsivity as potential transdiag- 
nostic features. Internet-application-spe- 
cific features and how they affect behavior 
should also be investigated more system- 
atically, such as “winning elements” when 
finding a product on shopping sites, a 
pornographic video that fits best with in- 
dividual preferences, or receiving likes on 
social networks, and should be compared 
with mechanisms involved in gambling 
disorder. Internet applications that are 
used addictively are designed to keep users 
using them. Notably, and unlike addictive 
substances, the applications are adaptive 
to usage patterns, provoking individual 
reinforcement experiences. This may fur- 
ther contribute to strengthening the “feels 


better” and “must do” driving paths, po- 
tentially overriding self-control while de- 
veloping online addictive behaviors. 

A deeper analysis of application-specific 
features and how these influence the user’s 
motives (e.g., discovery, need to belong, 
materialistic values) may improve under- 
standing of vulnerabilities and causalities. 
It is unclear whether online addictive be- 
haviors are developed because of a general 
tendency to be addicted or whether the on- 
line environments are specifically addictive. 
Assessing general risk and resilience factors 
in combination with symptoms of several 
addiction disorders and application-specific 
features may help disentangle general from 
specific addiction mechanisms. This may 
then also answer the question of whether 
different types of online addictive behaviors 
should be considered independent entities 
(like gaming disorder), one group of prob- 
lematic internet use, or the online type of a 
specific disorder (e.g., problematic pornog- 
raphy use as an online variant of compul- 
sive sexual behavior disorder). 

Online addictive behaviors are not an indi- 
vidual choice but may be a consequence of dif- 
ficulties in making advantageous choices and 
regulating rewarding behaviors. To optimize 
prevention and treatment of these emerging 
mental health conditions, more global collab- 
oration networks among research institutes 
are needed to realize large-scale multi-meth- 
odological longitudinal studies. Of particu- 
lar interest should be if and how societal 
changes (e.g., increased internet use since the 
pandemic) will stably affect prevalence rates 
of online addictive behaviors. 
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Electric buzz in a glass of pure water 


Hydrogen bond charge transfer in water may have far-reaching chemical implications 


By Dor Ben-Amotz 


ater is far more than a collection 

of neutral H,O molecules, linked 

by hydrogen bonds, with a trace 

of hydrogen (H*) and hydroxide 

(OH-) ions. Indeed, recent work 

has revealed that pure water has 
an additional shimmering buzz arising 
from electron transfer between neighbor- 
ing water molecules, in which the hydro- 
gen atom on one water molecule forms a 
hydrogen bond with a neighbor’s oxygen 
atom (J-4). This charge transfer, and the 
resulting attraction between the pair of op- 
positely charged water molecules, contrib- 
utes to the strength of the hydrogen bonds 
among water molecules (4). Hydrogen- 
bonding defects in liquid water lead to 
the accumulation of negative or positive 
charge on water molecules with an odd 
number of hydrogen bonds (2-4). These 
charged waters may play an overlooked 
role in chemical reactions at the air-water 
and oil-water interfaces that are ubiqui- 
tous in biological, geological, and environ- 
mental chemistry. 

The importance of hydrogen bonds in 
dictating the structure of water, as well as 
the presence of H* and OH, has 
been well understood since the 
early half of the 20th century. 
This understanding includes 
the propensity for each water 
molecule to donate two and ac- 
cept two hydrogen bonds, and it 
is precisely these four hydrogen 
bonds that give ice its tetrahe- 
dral geometry. However, a par- 
tial breakdown of this geometry 
is what makes ice float on water, 
whose slightly denser structure 
includes water molecules that 
form an odd number of hydro- 
gen bonds. 

It has recently become clear 
that there is something big 
missing in this long-standing 
view of the structure of water. 
Specifically, recent work implies 
that water must also be teeming 
with additional, previously un- 
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recognized, charged water molecules (1-4). 
The formation of these charged waters is a 
consequence of the quantum mechanical 
delocalization of electrons, which allows 
an electron on one water molecule to stray 
onto its neighbor. The resulting transfer 
of charge between hydrogen-bonded wa- 
ter molecules is supported by both theo- 
retical predictions (2-4) and experimental 
evidence of charge buildup at an oil-water 
interface resulting from water charge trans- 
fer (1). This transfer of electrons between 
H,O molecules is akin to the phenomenon 
known as quantum superposition (see the 
figure). Like Schrédinger’s cat, which is 
both dead and alive, a pair of hydrogen- 
bonded water molecules can be either 
charged (i.e., as H,O*--H,O-) or uncharged 
(i.e., as H,O--H,O). The charged waters are 
formed when the hydrogen bond acceptor 
transfers an electron to the hydrogen bond 
donor. This charge transfer has a predicted 
probability of at least 2% (2, 3), which cor- 
responds to an average charge exchange of 
+0.02 e for every hydrogen bond in liquid 
water. The resulting attraction between the 
positively and negatively charged waters 
increases the strength of all the hydrogen 
bonds in liquid water (4). 


Fleeting charge transfer and 
longer-lived ions in water 


Hydrogen bond charge transfer generates a fleeting positive (+) 
or negative (—) charge on water molecules. This may give rise to longer-lived 
partially charged water molecules (H,0** and H,O~ with 6 = 0.02) 
in equilibrium with hydrogen (H*) and hydroxide (OH") ions, and perhaps also 
hydrated electrons (H,Oe") and water radical cations (H,0"*). 
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These charge transfers can also influence 
the average charge of individual water mol- 
ecules. For most water molecules, which 
have an equal number of hydrogen bond 
donors and acceptors, the average charge is 
zero. However, 30% of the molecules in liq- 
uid water have an odd number of hydrogen 
bonds, with either two donors and one ac- 
ceptor, or two acceptors and one donor. This 
hydrogen bond imbalance leads to a persis- 
tent average charge of +0.02 e on the water 
molecules with an odd number of hydrogen 
bonds (4). Because water has a concentra- 
tion of 55.5 moles per liter (M), this implies 
that these partially charged water mol- 
ecules have a tremendously high concentra- 
tion of ~17 M (from 55.5 M x 30%) (4). As 
electron charges are indivisible, a partial 
charge of +0.02 e on a water molecule, in 
reality, means that this water molecule 
has a 2% chance of carrying a full charge 
of +e. Thus, the 17 M concentration of par- 
tially charged water molecules implies that, 
at any instant, pure water contains 0.3 M 
of fully charged water molecules (from 
17M x 2%). 

This instantaneous concentration of 
H,O* and H,O is about a million times 
higher than the 10°’ M equilibrium con- 
centration of H* and OH in 
pure water. This implies that 
the trace amount of H* and OH- 
is in equilibrium with a million 
times higher concentration of 
charged water molecules. This 
further suggests that there is a 
probability of roughly one in a 
million (0.0001%) of converting 
these charged waters to H* and 
OH. Converting H,0*--H,O- 
to H,O*--OH” requires only a 
slight displacement of a hydro- 
gen atom from the hydrogen 
bond donor toward the accep- 
tor (note that the hydronium 
ion, H,0*, is formed by attach- 
ing a H* ion to a neutral H,O 
molecule). Thus, hydrogen 
atom transfer provides a pos- 
sible mechanism that converts 
charged water molecules to the 
trace amount of H* and OH” in 
pure water (5). Moreover, the 
experimentally observed  in- 
crease in the concentrations of 
H* and OH” in water with in- 
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creasing pressure and temperature implies 
that the hydrogen atom transfer probabil- 
ity increases with increasing water den- 
sity and thermal energy. In other words, 
although the presence of H* and OH in 
pure water has long been known, the pos- 
sibility that these ions may be generated by 
the much higher concentration of charged 
water molecules has not been considered. 

The quantum superposition of 
H,O--H,O and H,0'--H,O also implies 
a rapid charge fluctuation with an on-off 
blinking rate on the order of a thousand 
terahertz—estimated by assuming the 
round-trip time of an electron bouncing 
between two neighboring water molecules. 
Although these fleetingly charged waters 
are very short-lived, they bear a striking re- 
semblance to much longer-lived H,O* and 
H,O° ions. Long-lived H,O* has recently 
been discovered in charged water drop- 
lets and found to be chemically reactive 
(6). Long-lived H,O° is closely related to a 
“hydrated electron,” whose structure has 
long been debated, with the current con- 
sensus being that it can best be described 
as an extra electron partially bound to a 
water molecule. However, the fleetingly 
charged waters, which constantly blink in 
and out of existence, must be structurally 
and chemically distinct from their longer- 
lived relatives. This is because long-lived 
ions contribute to electrical conductivity 
and hydrated electrons have a blue color. 
Thus, the experimental conductivity and 
optical transparency of water imply that 
these long-lived H,O* ions and hydrated 
electrons have lower concentrations than 
H* and OH’. 

Water charge transfer has additional im- 
plications for interfacial water chemistry, 
including that occurring at macroscopic 
air-water or oil-water interfaces and at 
the surfaces of molecules such as alcohols 
and some amino acids dissolved in water. 
At all of these interfaces, there is an in- 
crease in the number of water molecules 
with an odd number of hydrogen bonds, 
and thus a buildup of charged layers near 
the water surface, as well as charge trans- 
fer from water to oil (/, 2). The resulting 
layer of excess negative charge near the 
surface is consistent with the observed 
negative charge of air bubbles and oil 
droplets (7, 2). Although the interpretation 
of such experimental evidence remains 
a subject of debate, there is no escap- 
ing the intimate link between hydrogen 
bonding, charge transfer, and the build- 
ing up of charge layers at air-water and 
oil-water interfaces (2, 4), as well as the 
following broader chemical implications. 

Chemical reactivity at the surface of wa- 
ter is a subject of intense current interest, 
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fueled by the discovery of up to a million- 
fold increase in the rates of some reactions 
at the surface of water compared with the 
rates in water (7). The buildup of charges 
near the surface, resulting from the greater 
number of charged water molecules with 
an odd number of hydrogen bonds, has 
been proposed as a mechanism for the ob- 
served reactivity changes (8). 

The buildup of charge near the surface 
of water may also affect chemical equilib- 
ria involving H*, OH, and other ions, and 
influence the solubilities of oily molecules 
in water. Although the affinities of H* and 
OH for air-water and oil-water interfaces 
remain controversial (5), surface tension 
and solubility experiments imply that such 
interfaces tend to expel OH and attract 
H*. Specifically, adding OH ions to water 
increases its surface tension and decreases 
the solubility of oily molecules, such as ben- 
zene, whereas adding H* ions has the op- 
posite effect. These observations indicate 
that the interfacial concentration of OH” is 
lower than that in bulk water, while the 
opposite is true for H*, which implies that 
H* has a slightly higher concentration near 
air-water and oil-water surfaces. The affin- 
ity of H* for such surfaces is apparently con- 
sistent with the attraction between H* and 
the negative surface charge layer induced 
by charge transfer. However, there may be 
other contributing factors, as simulations 
that did not consider charge transfer have 
also predicted that an oily methane mol- 
ecule repels small positive ions less than 
small negative ions (9). 

All interfacial reactivity changes, includ- 
ing changes in reaction rates and product 
yields, are linked to the propensity of the 
reactants, intermediates, and products to 
adsorb onto the surface. Such adsorption 
differences may depend on various chemi- 
cally specific mechanisms (10), including 
stabilization of reaction intermediates 
resulting from the same interfacial im- 
balance of hydrogen bond donors and ac- 
ceptors that is responsible for interfacial 
charge layering (J1). More general argu- 
ments indicate that adsorption should be 
more favorable for neutral than charged 
molecules, and for larger molecules or 
ions than smaller ones (12). The resulting 
high interfacial concentrations of neutral 
molecules or large molecular ions may 
greatly increase reaction rates, even if the 
associated activation energies required 
for the reaction to occur are unchanged. 
Additionally, the activation energy may de- 
crease, and thus rate constants increase, at 
surfaces—particularly for reactions involv- 
ing intermediates that are physically larger 
than the corresponding reactant species. 
This expectation is consistent with recent 


observations of the interfacial acceleration 
of bimolecular, but not unimolecular, reac- 
tions (13), as bimolecular reactions neces- 
sarily have intermediates that are larger 
than the corresponding reactants. 

A detailed understanding of the chemi- 
cal implications of water’s electrical buzz 
is only now beginning to emerge. There is 
no doubt that this buzz strengthens hydro- 
gen bonds and produces a high concentra- 
tion of partially charged water molecules. 
However, the precise concentrations of 
charged waters and the resulting interfa- 
cial charge layering have yet to be deter- 
mined. The influence of water’s electric 
buzz on interfacial chemistry is an even 
newer subject whose details and implica- 
tions remain to be explored. This is par- 
ticularly true of the similarities between 
chemical reactivity at both macroscopic 
interfaces and the molecular surfaces of 
solutes dissolved in water. Such similari- 
ties imply that the recently discovered ac- 
celeration of the rates of some reactions at 
macroscopic air-water interfaces (7) may 
also occur inside of water that contains 
oily molecules, as is the case in biological 
systems. Additionally, observations of the 
effect of oil chain-length on ion-oil inter- 
actions (14) and the effect of the concentra- 
tion of oily molecules on water structure 
(15) raise questions regarding the influ- 
ence of interfacial curvature and crowding 
on water charge transfer and chemical re- 
activity. Although the answers to many of 
these questions, as well as their practical 
implications, remain to be discovered, the 
ubiquity of water and aqueous interfaces 
leaves little doubt that there is much to be 
gained from such pursuits. 
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Law, policy, biology, and sex: 
Critical issues for researchers 


Researchers should be aware of how sex-difference science 
is (mis)applied in legal and policy contexts 


By Maayan Sudai', Alexander Borsa’, 
Kelsey Ichikawa’, Heather Shattuck-Heidorn‘, 
Helen Zhao’, Sarah S. Richardson® 


he interplay between legal and biosci- 

entific understandings of sex is pro- 

lific and complex. Biological evidence 

and reasoning circulate in lawmaking 

and policy-making across an array of 

politically contested issues, including 
health care, education, and LGBTQI+ rights 
and protections. There is often a substantial 
disjoint, however, between how scientists 
define and operationalize sex differences in 
their research and how lawmakers and pol- 
icy-makers make sense of these definitions 
and concepts as they strategically seek to bol- 
ster or challenge legal governance. Medical 
and life scientists who routinely incorporate 
sex-related variables in their research cannot 
eliminate superficial or malicious misuse of 
research by lawmakers and policy-makers, 
but awareness of the legal and policy land- 
scape can clarify the possible downstream 
consequences of researchers’ choices about 
how to operationalize sex-related variables in 
their studies. 

When lawmakers and policy-makers are 
required to define “sex” in rules and policies, 
they often rely on definitions found in profes- 
sional literature from the life and health sci- 
ences. As an example, the US Department of 
Health and Human Services (HHS) proposed 
a rule in 2019 to amend the anti-discrimina- 
tion section in the Affordable Care Act (ACA) 
to allow discrimination on the basis of gender 
identity or sexual orientation. The proposed 
rule referenced biomedical research articles 
on brain, liver, and cardiovascular diseases 
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that presented sex differences as strictly bio- 
logical and binary. In other cases, lawmakers 
and policy-makers build on the authority of 
scientists and on assertions about “proven 
science” more generally. A proposed bill in 
2019 to prohibit gender-affirming care under 
the age of 18 in the state of Illinois stated that 
“Scientists and other medical professionals 
have recognized that biological sex is a neu- 
tral, objective, and immutable fact of human 
nature.” Many states have proposed or passed 
laws that restrict the definition of sex to spe- 
cific biological criteria such as “anatomy” and 
“genetics” at the time of birth. Policy-makers 
and legislators have heavily relied on bio- 
logical definitions of sex when making laws 
that seek to limit access to public bathrooms 
on the basis of sex assigned at birth. These 
laws define sex as determined by “genetics,” 
a “person’s chromosomes,’ “anatomy,” or “as 
biologically defined” (7). 

Social movement leaders and public ad- 
vocates who participate in legal and public 
debates also rely on medical and scientific 
statements when writing court amicus briefs, 
submitting comments on a rule, and advo- 
cating in other policy arenas. Such groups 
often use statements made by medical as- 
sociations and scientific studies about sex, 
gender, and sexuality to convey their agenda. 
For example, in the high-profile court case of 
Gavin Grimm, a trans boy who was barred 
from using the boys’ common bathroom at 
school, advocates for Grimm argued for an 
understanding of gender identity as innate 
and biological, citing medical and scientific 
literature about gender dysphoria (a term 
referring to persistent distress or discom- 
fort about one’s gender, often in relation to 
one’s sex assigned at birth or bodily charac- 
teristics) and studies or theories connecting 
gender and the brain (such as the concepts of 
“brain sex” or “biological gender”) (J, 2). 

Using biological definitions in the law may 
sound like responsible governance, but it can 
result in harmful or illogical outcomes. This 
can occur through a variety of means, includ- 
ing taking biological sex categories out of 
context and applying them to cases of human 


legal rights where they are not relevant; un- 
critical use of binary sex categories in science; 
and ignoring bioscientific evidence about 
the complexity, mutability, context specific- 
ity, and plurality of sex and gender. In light 
of such concerns, we outline here central 
debates over conceptualizing sex in relation 
to biology in the law. Although we focus on 
US law and advocacy, reference to biomedi- 
cal research on sex differences to limit or ex- 
pand legal recognition of gender and sexual 
minorities is an emergent global strategy 
employed by international governing bod- 
ies, religious institutions, and national leg- 
islators, reflected, for instance, in Hungary’s 
anti-transgender legislation passed in 2020. 

Understanding how their work may be in- 
terpreted by others can help scientists iden- 
tify areas where they can be more explicit 
about what assumptions or positions their 
work can, or cannot, support. These con- 
siderations are relevant for biomedical re- 
searchers and clinicians not only at the level 
of conducting and communicating basic and 
applied science, but also for ethically reflect- 
ing on the impact of scientific research on 
peoples’ opportunities to lead full, flourish- 
ing, healthy lives. 


CONTESTED AND COMPLEX 

In both the science and in law, sex is a con- 
tested and complex category. Sex is a ubiq- 
uitous category across the life and health 
sciences, with data routinely tagged as male 
or female. But increasingly it is recognized 
in the biomedical sciences that a strict sex 
binary fails to describe the variation present 
in human biology across domains long as- 
sumed to be strongly sexed (3). From neuro- 
science (4) to endocrinology (5) to preclini- 
cal research (6), scientists are exploring the 
limitations of the sex binary as a categorical 
system to describe and explain the varia- 
tion present in humans. For example, in 
contrast to widespread beliefs that humans 
have “sexed” brains, recent work within 
neuroscience reveals that human brains 
do not exhibit binary sex. Even in brain re- 
gions that show slight differences between 
females and males in aggregate, individual 
brains are not “male” or “female” but rather 
mosaic, exhibiting mixtures of “female” and 
“male” characteristics (7). Some “sex differ- 
ences” in disease or health outcomes have 
been revealed to be accounted for by factors 
like age or body size (8, 9). This is not to 
say that studying sex differences does not 
retain usefulness in certain areas of inquiry. 
Biologists routinely use “sex” as a concept, 
and in some narrowly defined research 
areas, such as scientific studies of the re- 
productive tract, a binary works relatively 
well to describe the variation in humans. 
However, in many areas, the premise that 
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sex deeply pervades most aspects of human 
biology and cleaves humans into two essen- 
tial types is being deeply challenged within 
scientific research. 

When US law categorizes people ac- 
cording to sex, it usually classifies them 
as either male or female. Sex classifica- 
tions referencing biological criteria are 
codified throughout state and federal laws, 
and regulations are sometimes found in 
surprising places. For example, in its 2014 
regulation “Standards To Prevent, Detect, 
and Respond to Sexual Abuse and Sexual 
Harassment Involving Unaccompanied 
Children,” the federal Office of Refugee 
Resettlement (ORR) defines “sex” using 
strictly binary groupings of “sex chromo- 
somes, gonads, internal reproductive or- 
gans, and external genitalia.” 

US laws and policies vary across time, 
jurisdiction, and context, and so do defini- 
tions of sex in the law. For example, a re- 
cent landmark Supreme Court decision in 
Bostock v. Clayton held that protection from 
discrimination “on the basis of sex” in Title 
VII of the Civil Rights Act includes sexual 
orientation and transgender status. The 
HHS subsequently announced that they 
will apply the same interpretation to the 
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ACA. Despite this, much legal policy con- 
tinues to reductively conceptualize “sex” as 
binary and to source biological science as 
the basis for this determination. 


SEX IN LAW: THREE APPROACHES 

In the face of these complexities and con- 
testations, legal actors must make situated 
and often value-laden choices about the use 
of sex classifications. Here, we characterize 
three broad-stroke approaches to sex classi- 
fication in the law employed by lawmakers 
and policy-makers, which we call essentialist, 
abolitionist, and pluralist. Appreciating the 
distinctions between these approaches can 
inform scientists’ considerations in offering 
constructive guidance when their expertise 
is invited in a law or policy setting and can 
clarify decisions about how to include, op- 
erationalize, interpret, and translate findings 
regarding sex-related variables. 

An essentialist approach to sex assumes 
that sex is a strictly binary and essential 
property of people defined by biological sci- 
ence. Administrative classifications of sex, 
such as the examples of the ORR given above, 
exemplify the sex essentialist approach. 
Essentialist conceptions of sex are often used 
in US state anti-LGBTQI+ legal initiatives 


aimed at determining who is permitted to 
use which bathrooms or to play which youth 
sports, and to generally enforce and justify 
segregation in sexed spaces, such as prisons, 
locker rooms, and homeless shelters. Binary 
sex essentialist approaches frequently rely 
on selective biological claims about sex and 
invoke the authority of science. For example, 
a 2021 law passed in West Virginia requires 
classification of sports competitions accord- 
ing to “biological sex,” defined as “an indi- 
vidual’s physical form as a male or female 
based solely on the individual’s reproductive 
biology and genetics at birth.” 

In strong juxtaposition to essentialism is 
abolitionism, which considers sex classifica- 
tions to be an unremitting harm and aims to 
remove them from law and policy. Prominent 
examples of this stance in action include 
recent efforts to remove sex from official 
identification documents such as drivers’ 
licenses or birth certificates, to convert sex- 
based bathrooms into gender-neutral or all- 
gender ones, and to endorse mixed-sex sport 
competitions. Another form of abolitionism 
can be found in laws that aim to replace 
sex classifications with different, perhaps 
more relevant, criteria: Instead of giving 
parenting-related benefits or special health 
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care to “mothers,” it gives them to the “pri- 
mary family caregiver,’ “pregnant person,” 
or based on lactation status (JO, 11). 

Such initiatives are often endorsed by med- 
ical experts and associations that attest to the 
complexity of gender and sex or to biologi- 
cal sex’s lack of necessity in the specific con- 
text. For example, in June 2021 the American 
Medical Association resolved to “advocate for 
the removal of sex as a legal designation on 
the public portion of the birth certificate,” 
arguing that this removal will not harm the 
collection of vital statistics needed for public 
health purposes and will help protect the pri- 
vacy of and prevent discrimination against 
intersex, trans, and nonbinary people. 

In contrast to both essentialism and abo- 
litionism is a pluralist approach. This ap- 
proach accepts that different contexts call 
for different definitions of sex. Pluralism 
is not essentialism, because it rejects that 
sex is a universally binary and biological 
property of individuals. Instead, definitions 
and conformations of sex-related categories 
are expected to vary according to the prag- 
matic needs of the policy or social context. 
But pluralism is also not abolitionism, be- 
cause it recognizes that the need for and 
characterization of sex classifications vary 
with context, and that in situated cases, sex 
classifications may be relevant and neces- 
sary—for example, to help extend rights and 
protections to vulnerable groups. 

Such policies create sex classifications 
that may change over time, perhaps retain- 
ing male and female but adding further cat- 
egories to accommodate those who do not 
fit the binary, or expanding the definition 
of males and females beyond biologically 
essentialist ones. Examples include policies 
that add a third gender/sex category, as seen 
in some identification documents. Dana 
Zzyym, an intersex nonbinary individual, re- 
cently became the first US citizen to receive 
a passport with an “X” gender/sex marker, 
designated for nonbinary, intersex, and gen- 
der-nonconforming people. Similarly, a plu- 
ralist approach may support sex classifica- 
tions for particular protected classes, such as 
in same-sex education or in sex-discrimina- 
tion statutes designed to protect vulnerable 
populations in the workplace. In this sense, 
some who may endorse abolitionism in the 
long run may choose a pluralist course in the 
short run. A pluralist approach means that 
in any particular case of gender or sex cat- 
egorization in law and policy, it is necessary 
to build an empirically grounded definition 
of sex specific to the context. 


RESPONSIBLE SCIENCE 

How might scientists who study sex-related 
variables incorporate within their own re- 
search practice an awareness and sensitivity 
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to how their work may be taken up in these 
debates? The first step is to recognize that 
scientific claims about sex are frequently 
and variously mobilized in a wider policy, 
regulatory, and administrative space, and 
that “sex” has different meanings across 
these realms and over time. For this reason, 
it is important to contextualize and appro- 
priately qualify uses of sex categories in 
research claims, including in publications, 
pedagogy, and engagement with the media, 
public, or policy-makers (12). 

Where possible, biomedical researchers 
should consider reducing unnecessary use 
of sex categories in the design, conduct, 
and interpretation of scientific studies. 
Often, sex categories are retained in data 
merely because they come tagged with sex, 
even if sex is not a central or relevant cat- 
egory of analysis. Frequently, researchers 
use sex as a proxy for other variables, ei- 
ther because other variables are more chal- 
lenging to collect or because sex is readily 
available in the data as a category of analy- 
sis. Rather than using crude sex categories 
as a proxy, scientists should use variables 
that are hypothesized as part of the direct 
mechanistic explanation for the condition 
of interest. This could mean, for exam- 
ple, reporting on hormone levels, weight, 
anatomy, presence or absence of gonads, 
or pregnancy status, instead of relying 
on male-female sex categories as a coarse 
means of capturing variability. 

Although it may seem self-evident that 
such practices are simply good science, 
they are actually rarely observed in stud- 
ies investigating and reporting sex differ- 
ences. For example, aiming to improve 
women’s health outcomes by correcting 
the historical record of androcentrism 
in biomedical research, in 2016 the US 
National Institutes of Health (NIH) intro- 
duced a Sex as a Biological Variable (SABV) 
funding mandate requiring preclinical bio- 
medical researchers working with cells, tis- 
sues, and model organisms to include male 
and female materials and to disaggregate 
data by sex. As analyses of the pronounced 
increase in studies of sex differences since 
the implementation of the mandate have 
shown, disaggregation by sex without 
proper statistical power and without an a 
priori reason in the study design contrib- 
utes to a muddle of contradictory results 
and false positives (13, 14). Given this, and 
the likelihood that a proliferation of scien- 
tific reports of sex-disaggregated findings 
may fuel sex essentialist law and policy, 
research funders and publishers should 
work to better align the imperative of con- 
ducting carefully contextualized studies 
of well-defined sex-related variables with 
SABV policies, including in journal publi- 


cation guidelines such as Sex and Gender 
Equity in Research (SAGER) (72). 

Finally, investigators of sex-related vari- 
ables should strive to incorporate discus- 
sion of, and where possible, variables re- 
lated to gender (norms, beliefs, behaviors, 
and structures related to masculine and 
feminine gender roles and identities) and 
other social factors. For example, when 
studying sex disparities in disease, it is pru- 
dent to consider gendered behaviors and 
structural variables that contribute to risk 
of disease exposure, health care access, and 
likelihood of reporting certain symptoms. 
Understanding sex and gender in a bioso- 
cial framework (5) can help prevent unwar- 
ranted essentialist interpretations that pose 
sex and gender as simple binaries, immu- 
table and hardwired by biology. 


REFLECTIVE, ETHICAL, ACCOUNTABLE 

As the broad and dynamic field of legal and 
policy activity reviewed here demonstrates, 
practices in the science of sex differences 
matter because conceptions of binary sex 
as an uncontested, simple biological fact 
are presently playing a central role in anti- 
discrimination policy and human rights law 
critical to protections for women and sexual 
and gender minorities. When scientific uses 
of biological sex concepts lack clarity, preci- 
sion, and rigor, this increases the risk that 
legal advocates will misunderstand and 
misrepresent scientific research on biologi- 
cal sex. Reflective, ethical, and accountable 
practices surrounding how sex differences 
are discussed, contextualized, and applied 
in scientific studies are vital when science 
may be used to justify harmful and discrim- 
inatory policy. 
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ANTHROPOCENE 


Shaping Earth in our image 


An interdisciplinary interrogation of the Anthropocene 
misses the chance to probe broader and deeper 


By Erle C. Ellis! and Mark Maslin2? 


n 12 essays by 19 scholars, Altered Earth 
seeks to set out the scientific evidence and 
the “human terms” that define the An- 
thropocene—the unofficial era of Earth’s 
history defined by human activities. 
With a foreword by esteemed historian 
Dipesh Chakrabarty, chapters by geologist 
Jan Zalasiewicz and Earth system scientist 
Will Steffen, and essays by vari- 
ous scholars across the humani- 
ties, this would seem the ideal 
text for those seeking to connect 
Anthropocene science with the 
humanities. If only it were so. 
The book employs a personal 
essay format designed to engage 
readers beyond the sciences. 


TALTERE 


as varied as disease (including COVID-19), 
the colonial politics of wind power, humans 
as political agents, the Chernobyl disaster, 
an expedition to Mars, and a vision of cities 
that “restabilize the Earth System.” 

Despite a final chapter proposing that 
“the Anthropocene calls for opening up 
perspectives,” the book consistently defers 
to a “growing consensus that our planet 
entered a dangerous new state in the mid- 
twentieth century.” There is no 
doubt that a “Great Accelera- 
tion” in environmental and so- 
cial trends occurred after World 
War II or that many of these— 
including greenhouse gas emis- 
sions—demand serious attention 
and action. But there is more 
to the Anthropocene than the 


es eae Altered Earth: : . 
There are some positive visions Getting the events that have transpired since 
of Anthropocene futures here,  AnthropoceneRight the 1950s. 
and editor Julia Adeney Thomas’s _ Julia Adeney Thomas, Ed. Earth scientists have variously 
Cambridge University 


proposition that “stories...mold 
the real world” is an inspiring 
jumping-off point for a book in which “narra- 
tives can emerge from conversations across 
disciplines—geologists and Earth scientists, 
on one hand, humanists and social scien- 
tists on the other.” Chapters explore topics 
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proposed Anthropocenes begin- 
ning 5000, 2000, 500, 200, and 
70 years ago, defined by land clearing for 
agriculture, soil tillage, the European colo- 
nization of the Americas, the start of the 
Industrial Revolution, and nuclear fallout, 
respectively. Related terminologies, in- 
cluding “Plantationocene,” “Capitalocene,” 
“Technocene,’ and even “Chthulucene,” 
have emerged in the social sciences and 
humanities, as scholars seek to articulate 
the myriad factors and behaviors that are 
changing the planet. 


Having voted to exclude other evidence prior to the 
1950s, a working group may define the Anthropocene 
using radionuclide signatures from Crawford Lake. 


All this scholarly ferment should sur- 
prise no one. At stake is no less than what it 
means to change the planet—and even what 
it means to be human. Yet these varied pro- 
posals are treated here as diversions from a 
single, “correct” scientific definition. 

Perhaps the most remarkable thing about 
this book is what is left out. There are no 
archaeologists among the authors, and 
some of the most important works ever 
published on the subject are not cited. The 
early Anthropocene hypothesis, which pos- 
its that intense farming activities triggered 
considerable anthropogenic changes in 
Earth’s climate as long as 8000 years ago, 
goes unmentioned. Nor do any of the essays 
challenge how the Anthropocene might 
eventually be defined by scientists as an ep- 
och of geologic time, despite ongoing efforts 
to do so by several of the authors. 

One chapter explains the entangling of In- 
digenous history and sedimentary processes 
in Crawford Lake in southern Ontario and 
how a “golden spike” in those sediments due 
to radionuclide fallout from nuclear tests 
might come to mark the start of the Anthro- 
pocene in the year 1952. Less clear, however, 
is the fact that the working group responsible 
for epoch definition had already voted not to 
consider any earlier human evidence in these 
or any other sediments (1). 

More concerning is the book’s failure 
to explain why leading stratigraphers and 
environmental scientists are increasingly 
convinced that defining the Anthropocene 
as an epoch of geologic time—1950s or oth- 
erwise—is neither useful nor scientifically 
accurate (2). Dividing Earth’s history into 
two parts at a single point in time seems 
arbitrary to them, when it is so clear that 
human transformations of this planet rep- 
resent an ongoing and evolving geological 
event that has been spreading and scaling up 
for millennia. 

Where is the essay on defining the Anthro- 
pocene in relation to the wealthiest people 
on Earth—those most responsible for climate 
change? Where are the billion black Anthro- 
pocenes (3) and the communities who suf- 
fered as a result of European colonialism? 
What of the Indigenous peoples who have 
sustained Earth’s biodiversity for millennia? 
In Altered Earth, this fertile and important 
ground for discussion is left to others. & 
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INNOVATION 


The fastest R&D lab on the planet 


From car safety to road design, motorsport innovation has spilled over into other arenas 


By Lee Vinsel 


Ithough long ago established as a 

serious and popular sport, automo- 

bile racing, or motorsport, is not ev- 

eryone’s cup of tea. But in his new 

book, Racing Green, journalist Kit 

Chapman argues that we should all 
be thinking about how racing has contrib- 
uted to society at large. 

The book’s title is something of a misno- 
mer, as only about a third of the volume is 
dedicated to environmental topics, with the 
remainder focusing on safety and speed. The 
main question that the book confronts— 
what are the sources of innovation?—has 
been a constant locus of attention in capital- 
ist nations since at least the 1950s. 

The vectors of new ideas are various. 
There are independent inventors, of course, 
and some corporations have research and 
development (R&D) labs. Universities also 
play a part, although not as big a part as 
they like to imagine. But in mature indus- 
tries, such as the automobile industry, in- 
novation is typically slow and incremental, 
focused on slightly improving established 
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ideas, for instance, to better reach different 
consumer markets. 

Chapman argues that things are different 
in motorsport, which he dubs “the world’s 
fastest R&D lab.” His story is one of what 
people in technology studies call “knowl- 
edge spillover’—how new ideas spread from 
one industry or sector to another. 

Unlike big automakers, Chapman con- 
tends, racing teams are constantly looking 
for technological breakthroughs that will 
give them advantages on the track. Motor- 
sport has also contributed to safety in vehi- 
cles, in road design, and in the operations of 
ambulances and emergency rooms. Perhaps 
more contentiously, Chapman argues that 
the sport could have big impacts on environ- 
mental improvements, including through 
battery technologies and artificial intelli- 
gence systems that maximize fuel efficiency. 

Although Chapman is prone to overstate- 
ment—“This is a book about how racing 
cars will save your life’—the book is full 
of well-told, often humorous, surprising, 
and interesting stories of how innovations 
moved from the racetrack to other domains 
of human life. Readers learn how famous 
crashes, such as the one that resulted in 
the tragic death of Dale Earnhardt, led to 
changes in safety technology both on and 
off the track; the state of battery research 


NASCAR driver Kyle Larson stands next to his wrecked vehicle after a crash in 2013. 
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and how racing teams have pursued elec- 
tric power to gain split-second advantages 
over competitors; and how the sport has ex- 
permented and is currently experimenting 
with previously unexplored materials such 
as carbon, wood, and flax. Chapman also 
does an excellent job conveying just how 
far motorsport has come environmentally, 
although he could have done more to high- 
light how much energy these cars are us- 
ing for what some would no doubt consider 
frivolous ends. 

At times, I wished Chapman had looked 
at motorsport in the larger context of tech- 
nological change. Knowledge _ spillover 
is normal, and new ideas are constantly 
moving from one area of society to an- 
other. Chapman demonstrates that auto 
racing has spun out ideas that have had 
impacts elsewhere, but the spillover from 
motorsport pales in comparison with the 
innovation seeded by the US Department 
of Defense, which has created so many 
things, including GPS and the internet. 

The state is missing in other areas of 
Chapman’s picture, too. Although motor- 
sport may generate new ideas, the main- 
stream auto industry has adopted many 
safety and environmental technologies 
only by being forced to do so through 
regulation (see airbags and catalytic con- 
verters) or by being offered subsidies (see 
electric vehicles). Put another way, one 
is sometimes left wondering exactly how 
Chapman thinks innovations in motor- 
sport will travel elsewhere to do the impor- 
tant work he hopes they will do. 

Still and all, the book is an entertaining 
and often insightful read that will be enjoyed 
by both racing fans and people generally in- 
terested in processes of innovation. Its tales 
of spillover from motorsport to other do- 
mains are useful for further contemplation, 
and Racing Green would be an excellent re- 
source to engage students in deeper discus- 
sions about technological change. ® 
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Standard housing for research animals does not provide enough space for healthy behaviors. 


Edited by Jennifer Sills 


Good science requires 
better animal welfare 


Biomedical research suffers from a crisis of 
reproducibility and translatability (J), with 
some studies estimating that more than 
50% of preclinical studies are irreproduc- 
ible (2). The lack of reproducible results 
has led to annual economic losses of about 
US$28 billion in the United States alone, 
delays to new treatments, and unneces- 
sary use of animals (2). To improve repro- 
ducibility and generate findings that are 
more generalizable to both sexes, the US 
National Institutes of Health (NIH) and 
the UK Medical Research Council (MRC) 
have each recently announced that they 
will only fund research that includes both 
male and female animals as default (3, 4). 
It is important that funders drive for bet- 
ter science, and inclusion of both sexes is 
to be welcomed given the heavy skew of 
biomedical research toward male subjects, 
which restricts diversity of research find- 
ings (7). However, the new rules conflict 
with efforts to minimize the use of animals 
in research and overlook the role of animal 
welfare in facilitating good science. 

The United Kingdom, the United States, 
and other countries have introduced plans 
for replacing, reducing, and refining the 
use of animals in research (known as 
the 3Rs) (5-7), based on long-established 
principles (8). The reduction goal refers to 
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reducing sample sizes wherever possible. 
If both male and female animals are used, 
as required by the new rule, sample sizes 
will likely need to be increased to prevent 
underpowered studies that fail to detect 
effects of treatments, sex, and their inter- 
actions. The MRC and NIH will have to 
provide clear guidance on how to accom- 
modate both objectives. 

Especially given the need for larger sam- 
ple sizes, it is crucial to ensure good animal 
welfare (consistent with 3Rs refinement 
goals) so as not to compromise results. It 
is widely argued that good animal welfare 
equals good science, but most funders 
have stalled in implementing better animal 
welfare standards. The standard housing 
used by most labs for animals, which is in 
line with guidelines and codes of practice 
(9, 10), includes confined spaces that don’t 
allow enough space to move normally, 

a lack of shelter from bright lights, and 
insufficient stimulation. These conditions 
lead to stress and poor health, which can 
affect both behavior and physiology (JJ). 
Animals in confined spaces are unable to 
perform highly motivated behaviors such 
as rearing and exploring, exhibit behav- 
ioral signs of depression such as anhedo- 
nia, show exacerbated severity of induced 
diseases such as cancer, and have increased 
mortality rates (11). To improve reproduc- 
ibility (and live up to the aspirations of the 
3Rs), the requirement to include both male 
and female animals must be accompanied 
by a requirement for substantial and long- 
awaited improvements to the housing 


of laboratory animals (72), including 
increased space, increased environmental 
complexity, and increased opportunities to 
engage in normal behaviors such as dig- 
ging and nesting. 
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Indigenous lands protect 
Brazil’s agribusiness 


Since taking office as Brazil’s president 

in 2019, Jair Bolsonaro has greatly weak- 
ened the protection of the Amazon and 

its Indigenous peoples (1). Bolsonaro’s 
hate speech against Indigenous peoples 
and policies dismantling the protection of 
Indigenous lands have led to an increase 
in invasions and attacks on Indigenous, 
public, and conservation lands (2), which 
his administration has rewarded rather 
than deterred (2, 3). Proposed bill PL 
490/2007 (4) would continue this trend by 
blocking or reversing the recognition of 
the rights of many Indigenous peoples to 
their traditional lands. 

Brazilian law guarantees that Indigenous 
residents have exclusive rights to their 
Indigenous lands, protecting them from 
non-Indigenous individuals and companies 
(5). However, the protected status requires 
approval through a bureaucratic process 
that can take decades. There are currently 
303 Indigenous lands that are in the pro- 
cess of obtaining protection. 

PL 490/2007 would impede the final 
approval of the Indigenous lands currently 
working toward protection and allow 
revocation of the protected status enjoyed 
by many lands that were approved after 5 
October 1988, the date of Brazil’s current 
constitution (6). Those that reached the 
final approval stage before 1988 will not be 
affected by PL 490/2007, although they con- 
tinue to face threats from illegal invasions. 

The bill, which has been stalled in the 
National Congress since 2007, is now rac- 
ing toward final approval. It has passed 
the lower house of Congress and is being 
given priority in the Senate. A case before 
the Supreme Federal Court will decide on 
its constitutionality in the coming weeks. 
On 25 April, President Bolsonaro stated 
that if the court rejects the policies out- 
lined in the bill, he will likely refuse to 
comply with the decision (7). 

Agribusiness representatives justify 
their support for the bill with the mis- 
leading argument that the preservation 
of Indigenous lands would negatively 
affect Brazil’s agricultural production (7). 
However, the amount of idle deforested 
land in Brazil is more than sufficient to 
ensure the projected growth of Brazilian 
agribusiness (8). Indigenous lands protect 
25% of Brazil’s Amazon biome, where 
deforestation is now close to the limit that 
the forest can tolerate (9). Brazil is already 
facing a severe water crisis that has 
affected both agribusiness and the water 
supply for domestic use (JO). Further 
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deforestation in the Amazon would 
worsen this crisis (11, 12), jeopardizing the 
country’s agribusiness. 

It is crucial that the Supreme Federal 
Court reject the policies proposed by PL 
490/2007. If the protection of Indigenous 
lands is withheld or revoked, countries 
that import soy and beef from Brazil must 
boycott these commodities to prevent 
increased deforestation in the Amazon 
and violation of Indigenous rights. 
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Irrawaddy dolphins 
continue to decline 


The Indo-West Pacific Irrawaddy dolphin 
(Orcaella brevirostris) inhabits coastal 
areas and rivers in South and Southeast 
Asia. The global population, listed as 
Critically Endangered by the International 
Union for Conservation of Nature, 

has declined by 50% over the past two 
decades (7) and recently became locally 
extinct in Vietnam’s Sekong, Sesan, and 
Srepok Rivers (1). In February, the last 
Irrawaddy dolphin in the Mekong River 
in Laos was found dead, entangled in gill- 
nets (2, 3). The remaining Mekong River 
population—about 90 individuals—spans 
only 180 km in Cambodia (1). Irrawaddy 
dolphins in the Mekong, along with 
those in Indonesia’s Mahakam River and 
Myanmar’s Ayeyarwady River, are threat- 
ened by anthropogenic activities that put 
the species at risk of imminent regional 
extinction (J, 4). Cambodia, Indonesia, 


and Myanmar must therefore act quickly 
to conserve the species. 

Gillnet fishing and dam building are 
among the main threats to these river dol- 
phin subpopulations. Gillnets, which are 
difficult to avoid, can trap and drown the 
dolphins. The dams divide the remaining 
individuals into smaller, isolated popula- 
tions (5). This habitat fragmentation is 
stressful to the species and reduces genetic 
variation, threatening the Mekong River 
subpopulation with a demographic collapse 
that could lead to extinction (6). 

The extinction of local freshwater popu- 
lations limits the conservation strategies 
available to protect the remaining popula- 
tions (7, 8). Therefore, the governments 
of Cambodia, Indonesia, and Myanmar 
must act immediately to implement the 
UN Environment Programme’s Convention 
on Migratory Species Action Plan for the 
Conservation of Freshwater Populations of 
Irrawaddy Dolphins to designate protected 
areas where gillnet fishing is banned (9). 
Assessments of the species’ status should 
use satellite telemetry to map its home 
range. Local fishermen outside designated 
refuges in all three river subpopulations 
should use acoustical deterrents and reflec- 
tive nets to reduce dolphin bycatch (10, 17). 
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Proteins adrift in a sea of mutations 


ntwo related proteins, one might expect the same mutation to have the same effect, but 

whether this is true greatly depends on context. Looking at nine ancestral reconstructions 

of a DNA-binding domain, Park et a/. evaluated the effects of 275 substitutions at predicted 

evolutionary steps. The effects of mutations were highly contingent on a dense network of 

epistatic interactions. There were mutations with both short and long memory length, but 
the average rate of epistatic change was generally constant. The landscape of potential and 
actual interactions is thus constantly shifting at a rate that can be known, but the direction of 
the change is contingent on evolutionary history and becomes unpredictable after a time. 


—MAF Science, abn6895, this issue 


p.823 


Artist’s representation of the changing effects of mutations during long-term protein evolution 


Changing the balance of 
orders with light 


Charge-density wave (CDW) 
order, the spatial modulation 
of electronic density, is known 
to compete with supercon- 
ductivity in copper oxide 
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superconductors. This under- 
standing comes largely from 
equilibrium experiments, and 
the dynamics of the interplay of 
these orders are less explored. 
Wandel et al. used ultrafast 
laser pulses to quickly quench 
superconductivity in samples of 


the YBa,Cu,O,,, superconductor, 


which temporarily enhanced 
the CDW order. The process 
increased the correlation length 
of the CDW order, suggesting 
that superconductivity stabilizes 
CDW defects that are removed 
by suppressing superconductiv- 
ity. —JS 

Science, abd7213, this issue p.860 


Extracellular processing 
raises alarm 


Plants that lack defensive 
responses to given microbial 
pathogens can fail miserably, as 
the potato plant did when faced 
with the oomycete pathogen 
Phytophthora infestans, the 
cause of the potato blight that 
contributed to famines in the 
mid-19th century. Kato et al. show 
how the small mustard plant 
Arabidopsis defends against this 
same oomycete. A sphingolipid 
from the oomycete’s cell mem- 
brane is cleaved by a ceramidase 
that the plant puts out into the 
apoplast. One of the cleavage 
products is a branched sphingoid 
base that is then recognized by 
a lectin receptor-like kinase on 
the plant cell surface, kicking off 
the plant's immune responses. 
Without the apoplastic cerami- 
dase or the cell surface lectin, the 
plant's ability to defend against 
this oomycete pathogen is com- 
promised. —PJH 

Science, abn0650, this issue p. 857 


Modifying 
mechanotransduction 


Split thickness skin grafting 
(STSG) is used to treat deep 
burns and other traumatic 
skin injuries, but the method is 
complicated by hypertrophic 
scar formation, contractures, and 
potential loss of biomechanical 
function. Chen et al. developed 
a porcine model of STSG and, 
through single-cell RNA sequenc- 
ing, identified up-regulation of 
mechanotransduction signaling 
pathways in the healing grafts. 
Applying a hydrogel containing 
a focal adhesion kinase (FAK) 
inhibitor to the grafts to disrupt 
mechanotransduction improved 
healing and reduced contracture 
and scar formation, with anti- 
inflammatory effects in the acute 
setting and pro-regenerative 
effects at later time points. These 
findings suggest that FAK inhibi- 
tion could be beneficial for the 
treatment of injuries requiring 
STSG. —MN 

Sci. Transl. Med. 14, eabj9152 (2022). 
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CELL DEATH 
A basic way to activate 


necroptosis 
Cell death by necroptosis 
canonically involves activa- 
tion of the kinase RIPK3 by the 
upstream kinase RIPK1, leading 
to plasma membrane rupture 
by the RIPK3 effector MLKL. 
Zhang et al. found that osmotic 
stress directly activated RIPK3 
and stimulated necroptosis in 
a manner that required MLKL 
but not RIPK1. Osmotic stress 
increased intracellular pH 
through the Na*/H* exchanger 
SLC9AI, and RIPK3 was acti- 
vated in cells with intracellular 
pH set at basic, nonphysiologi- 
cal values. Thus, induction of 
necroptosis by osmotic stress 
bypasses the activation of 
RIPK3 by RIPK1. -WW 

Sci. Signal. 15, eabn5881 (2022). 


KINETOCHORE 


Atight grip 

Kinetochores are large, multi- 
protein complexes responsible 
for the attachment of chromo- 
somes to the mitotic spindle, 
helping cells to segregate their 
DNA into two daughter cells. 
However, how kinetochores 
establish a robust attach- 
ment between chromosomes 
and the mitotic spindle has 
been unclear. Yatskevich et 

al. determined a cryo—elec- 
tron microscopy structure of 
the human inner kinetochore 
bound to centromeric chroma- 
tin, revealing that these large 
macromolecular machines 
tightly grip and encircle the 


Cryo-electron microscopy structure 
revealing how kinetochores 

interact with DNA and nucleosomes 
in the centromere 
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linker DNA emerging from the 
specialized centromeric CENP-A 
nucleosome. This study pro- 
vides molecular details of how 
kinetochores can form robust, 
load-bearing attachments that 
can withstand chromosome 
congression and segregation 
forces. —DJ 

Science, abn3810, this issue p.844 


GEOPHYSICS 
Relying on repeaters 


Underwater optical cables can 
be used to monitor seismic 
disturbances and ocean cur- 
rents, but the signal tends to 
be integrated over the entire 
length of the cable, which can 
be thousands of kilometers 
long. Marra et al. were able to 
isolate individual segments of a 
5800-kilometer-long cable for 
seafloor monitoring. Because 
undersea cables have repeat- 
ers every 90 kilometers, these 
segments could each be used 
as vibrational sensors when 
coupled with a laser source. This 
approach allowed the authors to 
better constrain the location of 
an earthquake through triangu- 
lation, thus offering a method for 
much better spatial resolution 
for undersea monitoring. —BG 
Science, abo1939, this issue p. 874 


CELL BIOLOGY 
Splitting cell sisters 


During cell division, sister 
cells generally break apart 
so that they can function as 
separate entities. However, 
some specialized cells avoid 
the abscission step and remain 
intimately connected through 
cytoplasmic bridges, as seen 
with germline cysts. Mathieu 
et al. found that mutating the 
Usp8 gene was sufficient to 
transform incomplete divisions 
into complete divisions, and, 
conversely, overexpressing 
USP8 in Drosophila germline 
stem cells was sufficient to cre- 
ate ectopic cytoplasmic bridges 
with their daughter cells. USP8 
participated by deubiquitinat- 
ing target ESCRT-III proteins to 
switch between incomplete and 
complete cell divisions. —BAP 
Science, abg2653, this issue p.818 
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Feedback shapes cochlear activity 


he dorsal cochlear nucleus (DCN) of the brainstem is a crucial 
point in the auditory pathway where sensory sharpening, modula- 
tion, and gating take place. Descending inputs from higher-order 
areas of the auditory system project back to auditory pathway 
nodes, potentially providing attentional and contextual informa- 
tion. The inferior colliculus (IC) in particular sends signals back to the 
DCN. Balmer and Trussell identified the cell types across the entire DCN 
that receive input from IC and tested their postsynaptic responses. 
Granule cells, unipolar brush cells, and Golgi cells received direct, glu- 
tamatergic input. Numerous other cell types also processed this input, 
suggesting that the IC is a strong source of top-down control of the first 
auditory processing region in the brain that may sharpen tuning and 
improve auditory feature detection. —PRS J. Neurosci. 42, 3381 (2022). 


The inferior colliculus region in the brain contains cell types 
involved in feedback for fine-tuning hearing. 


CELL BIOLOGY 
Lipid-driven 

lysosome repair 

Damaged lysosomes threaten 
cell viability if their contents 
leak into the cytosol. ESCRT 
machinery can repair damaged 
lysosomes, which are par- 
ticularly vulnerable to injuries. 
Niekamp et al. uncovered an 
ESCRT-independent, sphingo- 
myelin (SM)—based lysosomal 


repair pathway. Perturbations in 
lysosomal or plasma membrane 
integrity were tightly coupled 

to a rapid, calcium-activated 
scrambling of SM across 

the bilayer. Live-cell imaging 
revealed a critical role for SM in 
the recovery of lysosomes from 
acute, potentially lethal dam- 
age. SM was not required for 
ESCRT recruitment to damaged 
lysosomes. Instead, conversion 
of SM by neutral SMases on 
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the cytosolic surface of injured 
lysosomes promoted their 
repair. Thus, calcium-activated 
SM scrambling and turnover 
can clear minor lesions from the 
lysosome-limiting membrane 
and prevent lysosome damage- 
induced cell death. -SMH 


Nat. Commun. 13,1875 (2022). 


Cleaning up mitochondria 
Although most genetic dis- 
eases in humans are caused 

by alterations in nuclear DNA, 
some are caused by muta- 

tions in mitochondrial DNA. 
Because the maternal genome 
is inherited directly from the 
mother, assisted reproductive 
technology to help patients with 
mitochondrial genetic disorders 
involves transferring the nucleus 
from the mother’s oocyte into 

a donor oocyte with healthy 
mitochondria. Unfortunately, the 
nucleus transferred through this 
procedure generally includes 
some of the patient's mitochon- 
dria, which carry the disease 
mutation and can gradually 
proliferate and cause disease 
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recurrence in subsequent gen- 
erations. A method developed 
by Fan et al. involves tagging 
the patient’s mitochondria with 
an autophagy receptor before 
nuclear transfer, forcing the 
resulting oocytes to selectively 
destroy the mitochondria 
carrying the disease-causing 
mutation. —YN 

Nat. Biomed. Eng. 6, 339 (2022). 


Snapshots of cofactor 
chemistry 


One of the key functions of 
enzymes and their associated 
cofactors is controlling how 
electrons and protons are deliv- 
ered during oxidation-reduction 
reactions. Maestre-Reyna et al. 
used serial x-ray crystallography 
to investigate light-driven elec- 
tron- and proton-transfer steps 
in a flavoenzyme involved in DNA 
repair and observed structural 
changes in the flavin cofactor 
after light-induced electron 
transfer. A redox-sensitive triad 
of residues facilitated proton 
transfer from a proximal arginine 


Letting the needle drop 


s environmental stresses such as drought 
become more frequent, physiological and 
morphological traits can provide clues 
as to how different species will respond. 
Song et al. examined variation in leaf and 
stem traits among 28 conifer species (grown 
in acommon garden experiment) to identify 
how they relate to trees’ drought resilience. 
The authors found a trade-off in stem traits 
between hydraulic efficiency and wood cavita- 
tion resistance (a “hydraulics spectrum’) 
and a continuum in leaf traits from tough and 
long-lived to acquisitive (a leaf economics 
spectrum). Species from warmer climates had 
more acquisitive traits and greater resilience to 
drought. Long-lived leaves (needles or scales) 
signaled lower drought recovery because these 
species were less able to grow new leaves and 
adjust to changing conditions. —BEL 
New Phytol, 10.1111/nph.18177 (2022). 


Drought is particularly harmful to trees with long-lived 
leaves, such as these spruce trees, because these species 
have poor capacity for leaf regrowth. 


residue to the single-electron 
reduced flavin. These snapshots 
reveal an uncommon mecha- 
nism of photoreduction and set 
the stage for understanding the 
photolyase chemistry that these 
enzymes perform. —MAF 
Nat. Chem. 10.1038/ 
$41557-022-00922-3 (2022). 


Searching for more 
Herbig stars 


Herbig stars are objects 
between two and eight solar 
masses that are in the final 
stages of formation by accret- 
ing material shortly before 
reaching the stellar main 
sequence. Vioque et al. used 
optical spectroscopy to identify 
128 new Herbig stars, increas- 
ing the total number known by 
more than 50%. By studying the 
emission lines in this expanded 
sample, they identified a change 
in the accretion properties at 
around four solar masses. This 
might be due to a change in the 
accretion mechanism, but other 
explanations are possible. The 


expanded sample of spectro- 
scopically observed Herbig 
stars will aid studies of this 
short-lived and poorly under- 
stood phase of stellar evolution. 
—kKTS 
Astrophys. J. 10.3847/ 
1538-4357/ac5c46 (2022). 


Pull-apart clusters 
Icosahedral carboranes are 
molecular polyhedra contain- 
ing two carbon atoms and 10 
boron atoms, each capped by 
hydrogen. For the most part, 
their rigid stability has taken 
center stage in applications. 
Sha et al. report that the ortho 
isomer of this class of clusters 
is amechanophore: Embedded 
inamethylacrylate polymer and 
exposed to shear forces during 
sonication, the cage breaks 
apart. Nuclear magnetic reso- 
nance and trapping studies are 
consistent with homolytic bond 
cleavage. By contrast, the meta 
isomer remains stable in similar 
circumstances. —JSY 

Angew. Chem. Int. Ed. 10.1002/ 

anie.202203169 (2022). 
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MICROTUBULES 
Trimming the cell’s 
skeleton 


An essential part of a cell’s 
backbone is a dynamic network 
of microtubules that helps 
them move, keep their shape, 
and transport cargo between 
different compartments. 
To achieve different cellular 
functions, microtubules are 
flagged by posttranslational 
modifications. Detyrosination 
of a-tubulin is such an altera- 
tion, during which a-tubulin’s 
terminal tyrosine is cleaved 
off. Abnormal detyrosination 
levels are linked to pathologi- 
cal conditions. Landskron et al. 
describe the enzyme MATCAP as 
a microtubule-binding protease 
that can detyrosinate a-tubulin. 
MATCAP., together with the 
previously identified detyrosinat- 
ing vasohibins, is important for 
normal brain development and 
function. —SMH 

Science, abn6020, this issue p.815 


TOPOLOGICAL MATTER 


Ubiquitous topology 
Topologically nontrivial materi- 
als were once thought to be the 
exception rather than the rule. 
More recently, high-throughput 
computations have shown that 
many more materials are topo- 
logical than was initially thought. 
Vergniory et al. expanded this 
approach to include all process- 
able entries in the Inorganic 
Crystal Structure Database 
and studied the materials’ band 
structure both at and away from 
Fermi energy. The researchers 
found that almost 88% of these 
materials had at least one topo- 
logical band. —JS 

Science, abg9094, this issue p. 816 


NEURODEVELOPMENT 
Competitive migration 
steers interneurons 


During brain development, both 
interneurons and oligoden- 
drocyte precursor cells travel 


SCIENCE science.org 


tangentially from their birth- 
place to the location where they 
will function. Lepiemme et al. 
looked for interactions during 
the migratory phase of mouse 
brain development. Although 
born in the same regions and 
destined to be intertwined in 
the cortex, nterneurons and 
oligodendrocyte precursor 
cells differed in how they got 
there. Oligodendrocyte precur- 
sor cells tended to track along 
blood vessels, whereas inter- 
neurons migrated in collectively 
organized streams. Imaging 
showed that oligodendrocyte 
precursor cells drove migrat- 
ing interneurons away from the 
blood vessels. Left to fend for 
themselves, the interneurons 
organized in migratory streams. 
—PJH 

Science, abn6204, this issue p. 817 


NEUROSCIENCE 
Is the internet addictive? 


There is increasing awareness 
that certain applications on 
the internet can elicit addic- 
tive behaviors in some people. 
Problematic gaming and gam- 
bling are now recognized as 
disorders, but the same may 
also apply to shopping, social 
network usage, and pornog- 
raphy. In a Perspective, Brand 
discusses whether there are 
parallels between addictive 
internet usage and the path 
to substance addiction. By 
examining the neurobiological 
and psychological mechanisms 
that lead to addiction to internet 
applications, it may be possible 
to understand what makes 
some people vulnerable to 
addiction, what the causes are, 
and whether there are specific 
mechanisms involved in internet 
addiction that are different from 
other addictions. Given the cur- 
rent climate of social distancing 
and the attendant increased 
internet usage, it is important 
to understand how problematic 
internet addiction can be pre- 
vented. —GKA 

Science, abn4189, this issue p. 798 


STRUCTURAL BIOLOGY 
Big complexes from 
single-celled eukaryote 


Mitochondria use a series of 
membrane-embedded enzymes 
to transform chemical energy 
first into an electrochemical 
gradient and then into ATP. The 
series of chemical steps that 
take place are conserved in 
aerobic organisms, but there 
is considerable structural 
divergence in different lineages. 
Working with extracted mito- 
chondrial membranes, Zhou 
et al. determined structures of 
mitochondrial electron transfer 
chain complexes from a model 
ciliate (see the Perspective by 
Huynen and Elurbe). Complex | 
and a dimer of complex II! form 
a supercomplex as in many 
other eukaryotes, but both 
have important fundamental 
differences in architecture and 
functional apparatus. Complex 
IV is massive relative to previ- 
ously known structures, with 
more than half of the mass 
composed of components not 
seen in homologous structures, 
including mitochondrial carrier 
proteins and a translocase that 
may be involved in assembly or 
may be a relic of such compo- 
nents. —MAF 

Science, abn7747, this issue p.831,; 

see also abq0368, p. 794 


FOREST ECOLOGY 
Benefits of forest 
restoration 


Reforestation is promoted 
globally as one way to mitigate 
climate change through the stor- 
age of carbon in woody growth 
and ecosystem services such 
as control of soil erosion and 
management of water supplies. 
Hua et al. assessed the relative 
performance of plantation and 
native forest in achieving these 
goals (see the Perspective by 
Gurevitch). Synthesizing data 
from the world's major for- 

est biomes, the authors found 
that native forests consistently 
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delivered better performance 
than plantations in the provision 
of the three major ecosystem 
services, with additional benefits 
for biodiversity. The discrep- 
ancy was particularly evident in 
warmer and drier regions. These 
findings show that the benefits 
of reforestation will be best 
achieved through the restora- 
tion of native forests rather than 
extensive plantation programs. 
—AMS 

Science, abl4649, this issue p. 839; 

see also abp8463, p. 792 


BIOCATALYSIS 
Adapting iron enzymes 
for azidation 


Nonheme iron enzymes typically 
activate molecular oxygen to 
initiate oxidation of a substrate 
or atom transfer through radical 
mechanisms. Rui et a/. used 
directed evolution to encour- 
age a nonheme iron oxidase 
to perform enantioselective 
azide transfer using an ary! 
N-fluoroamide as both sub- 
strate and radical initiator. The 
resulting azides, installed on an 
ortho alkyl group, may be useful 
starting points for synthesis 
or handles for bioorthogonal 
coupling. —MAF 

Science, abj2830, this issue p. 869. 


FOSSIL RECORD 


Ghosts of the past 


The marine geological records 
of some past global warming 
events contain relatively few 
nannoplankton fossils, the lack 
which some interpret as being 
evidence of the impact of ocean 
acidification and/or related 
environmental factors on biocal- 
cification. Slater et al. present 

a global record of imprint, or 
“ghost,” nannofossils throughout 
several of those intervals during 
the Jurassic and Cretaceous 
periods (see the Perspective 

by Henderiks). This finding 
implies that a literal interpreta- 
tion of the fossil record can be 
misleading, and demonstrates 


814-B 


RESEARCH 


that nannoplankton were more 
resilient to past warming events 
than traditional fossil evidence 
would suggest. —HJS 
Science, abm7330, this issue p.853; 
see also abp9754, p. 795 


IMMUNOLOGY 
How T cells respond 


to antigens 


Although they are important 
tools in immunotherapy, we stil! 
have an incomplete understand- 
ing of how T cells respond to 
antigens in different temporal 
and molecular contexts. Achar 
et al. combined robotic sam- 
pling of mouse and human T 
cell responses over time with 
machine learning and modeling 
to gain insights into how T cells 
process information derived 
from the particular antigens 
to which they are exposed 
(see the Perspective by 
Nourmohammad). The analysis 
showed that patterns of cytokine 
release carried information 
about the type of antigen 
encountered and distinguished 
six distinct cellular responses 
rather than the three types 
usually recognized. Such under- 
standing is likely to enhance 
strategies for immunotherapies 
that rely on altered and designed 
T cell responses. —LBR 

Science, abl5311, this issue p. 880; 

see also abq1679, p. 796 


FORESTRY 
Diversity boosts 
plantation biomass 


Across experimental and natural 
systems, more diverse plant 
communities often have higher 
primary productivity. This effect 
can be caused by complemen- 
tarity between different species, 
which can more effectively 

use resources together, or by 

a higher likelihood of more 
productive species being 
present. Feng et al. used data 
from 255 sites to test whether 
forest plantations with multiple 
species have greater produc- 
tivity than monocultures (see 
the Perspective by Gurevitch). 
They found that multispecies 


plantings, on average, had taller 
and thicker trees and greater 
aboveground biomass accu- 
mulation than monocultures. 
Complementary between spe- 
cies was the primary driver of 
this effect, with greatest benefits 
from pairing species with differ- 
ent traits. —BEL 

Science, abm6363, this issue p.865, 

see also abp8463, p. 792 


MUCOSAL IMMUNOLOGY 


BMP puts the brakes on 
tuft cells 


Intestinal parasite infections 
or allergic reactions promote 
interleukin-13 (IL-13)—induced 
differentiation of tuft cells as one 
manifestation of type 2 immu- 
nity in the gut. Using organoid 
cultures of intestinal epithelial 
cells, Lindholm et al. investigated 
how the lymphocyte cytokines 
IL-13, IL-22, and interferon-y 
regulate the signaling pathways 
that influence epithelial dif- 
ferentiation. Although tuft cell 
IL-25 promoted the expansion 
of IL-13—producing type 2 innate 
lymphoid cells in a feedforward 
loop, the resulting IL-13 also 
induced ligands of the bone 
morphogenetic protein (BMP) 
signaling pathway. BMP agonists 
acted on stem cells to prevent 
runaway tuft cell expansion by 
limiting the expression of Sox4, 
a transcription factor required 
for tuft cell differentiation. These 
findings provide new molecular 
insights into how intestinal epi- 
thelial differentiation is carefully 
choreographed in response to a 
diverse array of cytokine signals. 
—IRW 

Sci. !mmunol. 7, eabl6543 (2022). 


LIFE SCIENCES 
Nano-cocoons for 
targeted RNA editing 


Within the collection of CRISPR- 
Cas enzymes, Cas13a is an 
RNA-guided, RNA-targeting 
CRISPR effector that is able 

to recognize and cleave target 
RNA to regulate gene expres- 
sion at the transcriptional 

level. However, Casl13a can 
induce collateral cleavage of 


814-C 20 MAY 2022 + VOL 376 ISSUE 6595 


nontargeted RNAs, suggest- 
ing the need for systems to 
target activity of Casl3a for 
use in therapeutic applications. 
Fan et al. designed targeted, 
multifunctional “nano-cocoons” 
that delivered Cas13a and its 
guide sequence within pH- 
responsive nanocapsules. Upon 
release and activation, Casl3a 
cleaved RNA nanosponges that 
were also contained within the 
nano-cocoons to release che- 
motherapy entrapped within the 
nanosponges. These hierarchi- 
cal nano-cocoons sequentially 
released an RNA-editing system 
and chemotherapy, which 
together inhibited glioblastoma 
cell proliferation in vitro and in 
vivo. —AKP 
Sci. Adv. 10.1126/ 
sciadv.abn7382 (2022). 
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MICROTUBULES 


Posttranslational modification of microtubules 
by the MATCAP detyrosinase 


Lisa Landskron, Jitske Bak, Athanassios Adamopoulos, Konstantina Kaplani, Maria Moraiti, 
Lisa G. van den Hengel, Ji-Ying Song, Onno B. Bleijerveld, Joppe Nieuwenhuis, Tatjana Heidebrecht, 
Linda Henneman, Marie-Jo Moutin, Marin Barisic, Stavros Taraviras, 


Anastassis Perrakis*, Thijn R. Brummelkamp* 


INTRODUCTION: Microtubules are an essential 
multifunctional component of the eukaryotic 
cytoskeleton and control cell division, shape, 
and migration by generating force and regu- 
lating trafficking of cellular cargo. Although 
all microtubules are built of a- and $-tubulin 
dimers, they differ in their chemical com- 
position owing to the existence of multiple 
tubulin isotypes and posttranslational mod- 
ifications. An abundant modification is the re- 
moval of the C-terminal tyrosine of o-tubulin, 
called detyrosination. This occurs mostly on 
stable microtubules and can be reversed by 
the tubulin tyrosine ligase (TTL), which acts 
on disassembled tubulin dimers. Aberrant 
detyrosination is linked to mitotic errors, car- 
diac failure, and neurological defects, whereas 
retyrosination is essential for life. Although de- 
scribed decades ago, our understanding of the 
mechanism and relevance of tubulin detyrosi- 
nation remains incomplete. 


RATIONALE: Recently, vasohibins (VASH1 and 
VASH2), in complex with their cofactor small 
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vasohibin-binding protein (SVBP), have been 
identified as detyrosinating enzymes. However, 
inactivation of vasohibins does not eliminate 
detyrosination in most cell types, suggesting 
the presence of one or more additional de- 
tyrosinating enzymes. Whereas other tubulin 
modifications are carried out by large protein 
families of multiple related enzymes, there are 
no proteases that closely resemble vasohibins. 
To identify the remaining detyrosinating en- 
zyme(s), we ablated VASH1 and VASH2 in 
haploid human HAP! cells and used genome- 
wide mutagenesis using tubulin tyrosination 
and detyrosination as readouts. The sensi- 
tivity of these genetic screens was enhanced by 
treatment with paclitaxel or removal of TTL, 
either to boost tubulin detyrosination or to 
make tyrosine cleavage irreversible. 


RESULTS: The genetic screens identified 
KIAAO895L, an unstudied protein without 
apparent protease features, as a remaining 
detyrosinase. Removal of KIAA0895L in com- 
bination with VASH1 and VASH2 resulted in 


Loss of detyrosination 
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VASH-SVBP ~~ 


Tyrosinated 
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Tubulin detyrosination is carried out by the distinct MATCAP and vasohibin detyrosinases. (A) The 
cytoskeleton of (U20S) cells consists of detyrosinated and tyrosinated microtubules. (B) MATCAP (blue) and 
VASHI-SVBP (orange and green) detyrosinate microtubules. [Drawn in Blender using the macro Geometry Nodes 
Microtubule] (C) Impaired detyrosination leads to tyrosinated microtubules and a reduction in brain volume. 


Illustration was created with BioRender.com. 
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undetectable levels of tubulin detyrosination in 
cultured cells. A crystal structure of KIAAO895L 
revealed an atypical metalloprotease fold 
that belongs to the gluzincins clan [includ- 
ing, for example, the severe acute respiratory 
syndrome coronavirus 2 (SARS-CoV-2) recep- 
tor angiotensin-converting enzyme 2 (ACE2)]. 
KIAA0895L had escaped annotation as a pro- 
tease owing to a degenerate signature motif 
(HExxxH instead of HExxH) that interrupts 
the active site-containing helix without subs- 
tantially affecting the integrity of the catalytic 
site arrangement. We renamed the protein 
MATCAP (for microtubule-associated tyro- 
sine carboxypeptidase). MATCAP and the 
vasohibins have adopted fundamentally differ- 
ent catalytic strategies (metalloprotease versus 
cysteine protease) to target the a-tubulin tail. 
In addition, whereas vasohibins cleavage relies 
on the second-last glutamate of the EEY ter- 
minal motif and only tolerates the chemically 
related phenylalanine in place of the o-tubulin 
terminal tyrosine, MATCAP is promiscuous as 
to the nature of the terminal residue and shows 
a dependence on a-tubulin tail residues also 
located further upstream for substrate recog- 
nition. A cryo-EM structure of catalytically 
inactive MATCAP bound to fully tyrosinated 
microtubules indicated that MATCAP recog- 
nizes tubulin dimers in the same protofila- 
ment, whereas the VASH1-SVBP complex sits 
in between neighboring microtubule proto- 
filaments. Unexpectedly, genetic ablation of 
MATCAP together with SVBP in mice led to 
viable offspring with undetectable levels of 
detyrosination. Despite their viability, these 
mice displayed morphological brain defects 
accompanied by proliferation defects during 
neurogenesis and displayed abnormal behavior. 


CONCLUSION: Our work shows that tubulin 
detyrosination in mammals involves two evo- 
lutionary distinct enzyme families: the vaso- 
hibin cysteine proteases in combination with 
their peptide cofactor SVBP and the atypical 
metalloprotease MATCAP identified here. The 
distinct evolutionary origin and structural ar- 
chitectures of the vasohibins and MATCAP 
raise the intriguing possibility that certain 
microtubule subpopulations could be differ- 
entially targeted by each detyrosinase. In mice, 
detyrosination by these enzymes was indis- 
pensable for correct brain development, which 
implies that defects in this posttranslational 
tubulin modification could contribute to cog- 
nitive abnormalities. 


The list of author affiliations is available in the full article online. 
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MICROTUBULES 


Posttranslational modification of microtubules 
by the MATCAP detyrosinase 


Lisa Landskron’, Jitske Bak’, Athanassios Adamopoulos!, Konstantina Kaplani~, Maria Moraiti?, 
Lisa G. van den Hengel!, Ji-Ying Song’, Onno B. Bleijerveld*, Joppe Nieuwenhuis°, 

Tatjana Heidebrecht’, Linda Henneman®, Marie-Jo Moutin’, Marin Barisic®°, 

Stavros Taraviras’, Anastassis Perrakis“, Thijn R. Brummelkamp'* 


The detyrosination-tyrosination cycle involves the removal and religation of the C-terminal tyrosine 

of o-tubulin and is implicated in cognitive, cardiac, and mitotic defects. The vasohibin-small vasohibin- 
binding protein (SVBP) complex underlies much, but not all, detyrosination. We used haploid genetic 
screens to identify an unannotated protein, microtubule associated tyrosine carboxypeptidase 
(MATCAP), as a remaining detyrosinating enzyme. X-ray crystallography and cryo-electron microscopy 
structures established MATCAP’s cleaving mechanism, substrate specificity, and microtubule 
recognition. Paradoxically, whereas abrogation of tyrosine religation is lethal in mice, codeletion 

of MATCAP and SVBP is not. Although viable, defective detyrosination caused microcephaly, associated 
with proliferative defects during neurogenesis, and abnormal behavior. Thus, MATCAP is 

a missing component of the detyrosination-tyrosination cycle, revealing the importance of this 


modification in brain formation. 


icrotubules are composed of polymer- 

ized o- and £-tubulin heterodimers 

and are core components of the cyto- 

skeleton that are essential for cargo 

trafficking, cellular organization, and 
force generation. Microtubule heterogene- 
ity is created by multiple o/f tubulin genes 
(isotypes) in combination with an array of 
posttranslational modifications that partic- 
ularly affect the disordered C-terminal tail of 
a-tubulin (/, 2). Most a-tubulin isotypes are 
translated with a C-terminal tyrosine, which 
can be enzymatically removed and religated 
(3, 4). This dynamic modification cycle has 
been implicated in cell division (5-8), neu- 
rogenesis (9-11), and heart failure (12-15). 
Detyrosinated tubulin also serves as the sole 
precursor for the irreversible generation of 
A2-tubulin, which when stimulated through 
chemotherapeutic agents leads to peripheral 
neuropathy (6). The removal of tyrosine can 
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be catalyzed by the vasohibins (two paralogs, 
VASHI1 and VASH2) in complex with the es- 
sential cofactor small vasohibin-binding pro- 
tein (SVBP) (J0, 17) and reversed by the tubulin 
tyrosine ligase (TTL) (Fig. 1A) (78). Loss of 
VASH-SVBP in cell lines (Fig. 1, B and C) (77) 
or mice (77) only partially impairs tubulin de- 
tyrosination, indicating the existence of yet-to 
be discovered enzyme(s), which complicates 
functional studies on the role of detyrosination. 


The tubulin detyrosination-tyrosination cycle 
is regulated by MATCAP 


To identify remaining tubulin carboxypep- 
tidases, we carried out genome-wide muta- 
genesis screens in human haploid HAP! cells 
lacking both vasohibins (Fig. 1D). For the first 
genetic screen, we treated the cells with the 
microtubule-stabilizing agent paclitaxel (which 
increases detyrosination), and we searched 
for factors that upon gene-trap inactivation 
reduced tubulin detyrosination (Fig. 1E). Ina 
second genetic screen, we removed TTL using 
CRISPR/Cas9 in the vasohibin-deficient cells 
(to make detyrosination irreversible) and used 
the amount of tyrosinated o-tubulin as read- 
out (Fig. IF). Combined analysis revealed the 
unstudied gene KIAAO895L as the strongest 
regulator that diminished detyrosination upon 
genetrap inactivation in the first screen and 
stimulated tubulin tyrosination upon inacti- 
vation in the second screen (Fig. 1G and fig. S1, 
A and B). In addition, KIAA0895L was also 
distinguished from most other hits by having 
no significant effect on 22 phenotypes un- 
related to tubulin biology that were previously 
examined by use of haploid genetic screens 
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(fig. S1C). Because KIAAO895L also scored in 
ATTL cells (Fig. 1F), it likely participates in 
tyrosine removal rather than religation. Thus, 
KIAAO895L affects tubulin detyrosination, 
and for reasons described below, we renamed 
it MATCAP (microtubule-associated tyrosine 
carboxypeptidase). 

As expected, tubulin detyrosination is only 
moderately decreased upon the loss of both 
vasohibins (Fig. 1H). Combined ablation of 
MATCAP and vasohibins resulted in undetec- 
table levels of detyrosination in different cell 
lines, even in the presence of paclitaxel (Fig. 1H). 
Furthermore, overexpression of MATCAP in 
HeLa cells, which normally have low levels of 
tubulin detyrosination, efficiently stimulated 
tubulin detyrosination (Fig. 11). Moreover, 
expression of MATCAP reestablished tubu- 
lin detyrosination in cells in which AVASH1 
AVASH2 AMATCAP had been deleted (Fig. 1J). 
Next, we asked whether the C termini of dif- 
ferent o-tubulin isotypes are differentially 
affected by MATCAP. For that, we fused dif- 
ferent tubulin tails to the C terminus of green 
fluorescent protein (GFP). Coexpression with 
MATCAP removes the terminal residue of all 
GFP-fused o-tubulin tails, including those cor- 
responding to tubulin alpha-8 chain (TUBA8) 
(which possesses a phenylalanine as termi- 
nal amino acid) and to TUBA4A (which is 
encoded without C-terminal tyrosine but can 
be retyrosinated by TTL) (fig. S2A). Thus, 
MATCAP acts on the C terminus of o-tubulin 
and affects detyrosination independently of 
the known tyrosination and detyrosination 
enzymes. 


MATCAP is a metalloprotease with an atypical 
zinc-binding motif 

Because MATCAP was not annotated as a 
protease, we expressed and purified both full- 
length MATCAP and MATCAP without its 
disordered N-terminal region (MATCAP!?"*7) 
(Fig. 2A). Next, we determined a crystal struc- 
ture of MATCAP??”*” using experimental 
phasing (Fig. 2B and table S1). The structure 
reveals that MATCAP belongs to the gluzincin 
metallopeptidase clan (79), which is charac- 
terized by a Zinc ion (Zn**). Consistent with 
this, EDTA abolished enzymatic activity (Fig. 
2C). The metal ion is coordinated by a HExxH 
motif incorporated in an o-helix and resembles 
the active site and the fold of the severe acute 
respiratory syndrome coronavirus 2 (SARS- 
CoV-2) spike-protein receptor angiotensin- 
converting enzyme 2 (ACE2) and the anthrax 
lethal factor (Fig. 2D). The HExxH motif is 
degenerate in MATCAP (7°°HExxxH”"), which 
has also been observed in other rare metallo- 
exopeptidases (20, 21). The extra residue de- 
stroys the canonical N;-O;,4 hydrogen bonding 
helical pattern (formally breaking the a6 helix 
in two) but forms an active site similar to that 
of its structural homologs, positioning the 
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Fig. 1. Haploid genetics identify MATCAP as a regulator of o-tubulin 
detyrosination. (A) Illustration of the tubulin detyrosination cycle. Vasohibins and 
a yet-unknown enzyme remove the C-terminal residue (tyrosine) from o-tubulin. 
(B and C) U20S cells proficient and deficient for VASH1 and VASH2, with and 
without paclitaxel treatment, were subjected to (B) immunoblot analysis (short 
and long exposures are indicated) and (C) immunostaining for detyrosinated 
a-tubulin. Scale bar, 20 um. (D) Schematic representation of a haploid genetic 
screen. (E and F) Haploid genetic screens for (E) detyrosinated a-tubulin in HAP1 
AVASHI1 AVASH2-deleted cells and (F) tyrosinated o-tubulin in HAP] AVASH1 
AVASH2 ATTL-deleted cells were performed. (The schemes were created with 
BioRender.com.) The antibody symbol indicates which o-tubulin state was 
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detected as readout for the genetic screen. In the fishtail plot, genes enriched for 
gene-trap mutations in the population with “high” or “low” antibody staining 
are colored with orange or blue, respectively [for both, two-sided Fisher's exact 
test, false discovery rate (FDR) corrected P < 0.05]. (G) For each gene, the 
difference in mutation index (MI) between the two screens is depicted as delta 
MI, which is calculated as Mlgey-tubulin screen ~ Mly-tubulin screen: (H) Lysates of 
indicated mutant cell lines analyzed for detyrosinated a-tubulin by immuno- 
blotting. (I) Lysates of HeLa H1 cells upon ectopic expression of hemagglutinin 
(HA)-tagged and untagged MATCAP were subjected to immunoblot analysis. 
(J) Immunoblot analysis of lysates from triple-mutant HAP1 cells (AVASH1 
AVASH2 AMATCAP) expressing HA-tagged MATCAP or GFP as control. 
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Fig. 2. The metallo-carboxypeptidase MATCAP 
detyrosinates o-tubulin. (A) Schematic 
representation of MATCAP and its conservation 
across vertebrates. Amino acid conservation of 
MATCAP across 39 vertebrate orthologous is 
shown as analysis of multiply aligned sequences 
(AMAS) conservation score (calculated in 
Jalview). The probability of disordered regions as 
predicted with PrDOS is shown at bottom. 
(B) Ribbon representation of MATCAP (blue) 
with the overlayed surface (gray). (©) Recombinant 
ATCAP protein was incubated with tubulin 
heterodimers in the presence or absence of 

EDTA. Tubulin modifications were analyzed by 
means of immunoblotting. (D) Comparison of 
AATCAP and two of the related members of the 
gluzincin clan, highlighting the zinc ion (yellow) 
and the binding residues. Arrows indicate 

the two separate a-helices. (E) Close-up of 

the MATCAP active site pocket (F) Quantification 
of immunoblots of lysates from HeLa cells 
expressing single-point mutants of the active 

site residues. The detyrosination activity 
(detyrosinated o-tubulin normalized to o-tubulin) 
was quantified in three independent experiments 
(top). Data are shown as mean + SD. An 

example immunoblot is shown (bottom). 


presumed nucleophile Glu?*" for catalysis. Be- 
cause the original crystallization conditions 
contained EDTA, Zinc ions were not visible. 
We thus determined a structure of MATCAP 
in the presence of Zn?* and refined both the 
apo and the Zn?*-bound structures to 2.1 and 
2.5 A, respectively. The Zinc ion is coordi- 
nated by His”°°, His”®°, and Glu®”® (Fig. 2E). 
The structure of the zinc binding site is well 
maintained in the apo structure, with the ex- 
ception of Glu®"°, which is further away from 
the zinc ion. As expected, MATCAP activity 
was metal dependent in vitro (Fig. 2B), and 
mutating the Glu2*' nucleophile to glutamine 
(E281Q) or the zinc-binding residues to alanine 
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abolished activity in cells (Fig. 2F) and in vitro 
(fig. S2, B and C). The structure predicted with 
AlphaFold (fig. S2D) (22) is similar to both 
experimental structures. Thus, MATCAP is a 
peptidase that relies on an atypical catalytic 
domain for its optimal enzymatic efficiency. 


Substrate recognition by MATCAP and 
VASH1-SVBP 


To elucidate how MATCAP recognizes its sub- 
strate, the negatively charged glutamate-rich 
C-terminal o-tubulin tail, we first visualized 
the electrostatic potential of MATCAP’s protein 
surface (Fig. 3A). Mutation of residues in the 
positively charged patch close to the catalytic 
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site decreased or abolished the activity of 
MATCAP (Fig. 3B). Further experimental mu- 
tant analysis combined with in silico docking 
by using HADDOCK (23) yielded two plausible 
models of the a-tubulin tail bound to MATCAP 
(Fig. 3C and fig. S3, A and B). Further muta- 
genesis experiments were equally supportive 
for both docking models (fig. S3, C to F). The 
C-terminal part of the tail is firmly positioned 
in a pocket for catalysis by two lysine (389 and 
417) and four arginine (334, 373, 376, and 393) 
residues in both models. According to these 
models, the tubulin tail could enter the pock- 
et through two routes. In contrast with the 
MATCAP tail-binding pocket, VASH1-SVBP 
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Fig. 3. The substrate binding mode of MATCAP. (A) Electrostatic surface 
potential of MATCAP. Scale bar, -10 to +10 q. Locations of positively charged 
residues selected for mutagenesis are indicated. (B) Quantification of the 
activity of indicated MATCAP mutants in Hela cells. Data are shown as 

mean + SD; n = 3 independent experiments. (©) Docking models of the 
a-tubulin tail (pink and magenta ribbon with side chains) binding to MATCAP 
(gray) from two different views. (D) MATCAP binds o-tubulin in a pocket, 


while the VASH-SVBP contains an open groove, both indicated with red arrows. 
(E) C-terminal a-tubulin peptides were incubated with either recombinant 
VASHI1-SVBP or MATCAP and analyzed by means of nano-liquid chromatography- 
mass spectrometry (nanoLC-MS)/MS. Peptide counts were normalized to 

the wild-type peptide. (F) GFP fused to a-tubulin tails with varying terminal 
amino acids were coexpressed together with the indicated detyrosinating 
enzymes in HeLa cells. 


binds its substrate in an open groove (Fig. 3D 
and fig. S4A) (5, 24-28). Furthermore, VASH1- 
SVBP engages with the second-last glutamate 
of the a-tubulin tail, whereas MATCAP forms 
an extensive network of interactions with many 
glutamates, and in particular with the last one, 
positioned before the scissile bond (fig. S4, B 
and C). Consequently, the two enzymes have 
different dependencies on the amino acids 
of the tubulin tail: In contrast to VASH1, 
MATCAP’s activity is affected by most amino 
acid alterations along the tubulin tail except 
the terminal residue, in vitro (Fig. 3E and fig. 
S4, D and E) and in cells (Fig. 3F). Thus, 
MATCAP removes the terminal residue in- 
dependently of its amino acid identity and 
strongly relies on the penultimate residue to be 
a glutamate for proper substrate cleavage. By 
contrast, VASH1 largely tolerates amino acid 
changes in the o-tubulin tail as long as the ter- 
minal residue is aromatic and the second-last 
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glutamate is intact (fig. S4, D to F). Thus, the 
distinct structural architectures of MATCAP 
and vasohibins represent two convergent sol- 
utions for tubulin detyrosination, with differ- 
ent substrate recognition and catalytic modes. 


A cryo-electron microscopy structure of 
MATCAP on microtubules reveals multiple 
binding interfaces 


Consistent with the reported preferential ac- 
tivity of brain detyrosinases on microtubules 
(29-31), recombinant MATCAP was more 
efficient in processing o-tubulin tails on 
microtubules than on free tubulin dimers 
(Fig. 4A). Recombinant MATCAP copelletted 
with microtubules in an ultracentrifugation 
sedimentation assay in vitro (Fig. 4B). Con- 
sistent with this, MATCAP preferentially lo- 
calized to microtubules in cells, especially 
when the catalytic nucleophile was mutated 
(E281Q) (Fig. 4C). 
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To understand how MATCAP engages with 
microtubules, we prepared fully tyrosinated 
microtubules from triple-mutant (AVASH1 
AVASH2 AMATCAP) cells and incubated these 
with recombinant catalytically inactive (E281Q) 
MATCAP to form a complex on cryo-electron 
microscopy (cryo-EM) grids. Electron micro- 
graphs showed MATCAP-decorated microtu- 
bules, further confirmed in two-dimensional 
(2D) and 3D class averages (Fig. 4, D and E, 
and fig. S5, A and B). The cryo-EM reconstruc- 
tion based on particles extracted from micro- 
tubules with 14 protofilaments has a nominal 
resolution of 2.9 A, with the binding interface 
being resolved to ~4.5 A (Fig. 4F and fig. S6). 
The cryo-EM density allows a clear and dis- 
tinct placement of atomic models of the tubulin 
dimer, and of MATCAP (Fig. 4G and fig. $5, C 
to E), and shows low-resolution density that 
can be attributed to the a-tubulin tail (Fig. 4F, 
inset), supporting only one of the computational 
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Fig. 4. Binding of MATCAP to microtubules. (A) In vitro incubation of wild- microtubules (fig. S5, A and B) and 2D classes of data, with and without 
type purified MATCAP with HeLa tubulin heterodimers or microtubules for MATCAP. Arrows indicate bound MATCAP. (E) Cryo-EM density of a microtubule 


the indicated times. n = 5 independent experiments. 


SEM, and the 95% confidence interval is shown as 
The levels of deY-tubulin were normalized to a-tub 


Data are shown as mean + 
blue and gray shades. 
ulin levels. (B) Cosedimen- 


tation assay of full-length MATCAP and paclitaxel-stabilized microtubules by 


using a glycerol cushion. Pellet (sedimented microtubules) and supernatant 


(nonsedimented proteins) were subjected to immuno 
taining of U2OS cells transfected with HA-tagged 


blot analysis. (€) Immunos- 


ATCAP constru 


cts. Scale 


bar, 20 um. (D) Representative cut-outs of electron micrographs showing 
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(gray) decorated with MATCAP (blue). (F) Cryo-EM map of MATCAP (blue) 
bound to the a-B-tubulin heterodimer (gray). Density corresponding to 

the C-terminal end of a-tubulin is in salmon pink; MATCAP density interacting 
with tubulin is in cyan. The displayed map is shown at isocontour 0.5 and 
0.05 for the density belonging to tubulin and MATCAP, respectively, to better 
visualize the map features at each region. (Inset) Highlight of density that 

is attributed to the o-tubulin tail. (@) Pseudo-atomic model from rigid-body 
fitting of the MATCAP crystal structure and tubulin (PDB: 7SJ7). The docking 
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model of the a-tubulin tail (ribbon with the C-terminal tyrosine as ball 

and stick) is highlighted in the inset and connected to the last residue of 

the crystal structure with a dashed line. Five interfaces are indicated: 
interfaces observed from the structure (pink) and contact sites suggested 

by mutations (gray). (H) Immunofluorescent staining of U2OS cells expressing 
the indicated MATCAP mutants. Scale bar, 10 um. (1) Quantification of the 


models for the o-tubulin tail entering the 
catalytic MATCAP pocket (Fig. 3C). 

The cryo-EM structure shows that the bind- 
ing interface between a-tubulin and MATCAP 
has three principal constituents: [Fig. 4G, (1)] 
the a-tubulin tail binding pocket; [Fig. 4G, (2)] 
an interface between the last helix of a-tubulin 
and the o-helix”’*?*? of MATCAP; and [Fig. 
4G, (3)] the loop?®”?°° that contacts an ad- 
jacent tubulin dimer along the microtubule 
protofilament. The structure also points out 
two “regions of interest”: [Fig. 4G, (4)] a pos- 
itively charged patch at the “backside” of 
MATCAP, covered by density which could 
possibly be attributed to binding of the nega- 
tively charged f-tubulin tail; and [Fig. 4G, (5)] 
the positively charged (pI 11.3), proline-rich N 
terminus of MATCAP, for which there is no 
clear experimental density. 

To study the interfaces, we designed muta- 
tions in each of them and monitored their 
effect on microtubule localization of MATCAP 
in cells. All five interfaces individually con- 
tributed to microtubule localization (fig. S7). 
Mutations in the helix-helix interface [Fig. 4G, 
(2)] combined with removal of the N-terminal 
region [Fig. 4G, (5)] decreased the binding 
of MATCAP to microtubules (Fig. 4, H and 
I). This effect was stronger than the effect 
of the separate mutants (Fig. 4, G and H, and 
fig. S7C). This combinatorial effect further 
demonstrates that multiple binding inter- 
faces are involved in recruitment of MATCAP 
to the polymer. Because the N-terminal dis- 
ordered region contributes to microtubule 
binding (fig. S7L), but its positioning in the 
context of protofilaments is unclear, it could 
influence MATCAP’s enzymatic activity on 
microtubules. To study this, we incubated 
AN-MATCAP with either tubulin dimers or 
assembled microtubules. In this assay, AN- 
MATCAP displayed greater enhanced activ- 
ity on microtubules than on tubulin dimers 
(fig. S7M) compared with that of wild type 
(Fig. 4A). Thus, whereas the disordered re- 
gion of MATCAP enhances its anchoring on 
microtubules, it dampens processivity on the 
polymerized substrate. 

Multiple specific interactions between 
MATCAP and microtubules ensure efficient 
binding to particularly the microtubule poly- 
mer and regulate detyrosination activity. 
Whereas MATCAP recognizes adjacent tubu- 
lin dimers along the same protofilament with 
the interdimeric loop interface, vasohibins 
recognize tubulin dimers in juxtaposed pro- 
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the protofilaments. 


tofilaments (Fig. 4J) (32), further illustrating 
the differences between these two tyrosine 
carboxypeptidases. 


Combined loss of MATCAP and SVBP perturbs 
detyrosination in vivo 


To address the physiological role of detyrosina- 
tion, we disrupted MATCAP in mice. Homozy- 
gous mice were viable and fertile and displayed 
no gross alterations. We subsequently crossed 
these mice with a mouse strain deficient for 
SVBP, which is required for vasohibin’s enzy- 
matic activity. The resulting double-mutant 
mice were viable, born at a Mendelian ratio, 
and showed body weights comparable with 
those of wild-type mice (fig. S8, A and B). Im- 
munohistochemical analysis of brain tissue 
indicated a strong decrease of detyrosinated 
tubulin (Fig. 5A). Similarly, immunoblot anal- 
ysis on brain tissue showed a reduction of 
detyrosinated tubulin and a concomitant in- 
crease in tyrosinated tubulin (Fig. 5B). Whereas 
A2 tubulin was undetectable, a faint signal was 
detected with detyrosination-specific antibodies, 
which could either be unspecific background 
signal, detection of TUBA4A (the tubulin iso- 
type that is translated without C-terminal 
tyrosine), or remaining detyrosination. Quan- 
titative proteomics on brain tissue from double- 
mutant mice detected exclusively tyrosinated 
but not detyrosinated TUBAIA/B (Fig. 5C). This 
finding was in agreement with our observa- 
tions in human cell lines (Fig. 1H) and in line 
with the undetectable levels of A2-tubulin in 
mice, whose production relies on detyrosi- 
nated tubulin as precursor. Thus, combined 
inactivation of SVBP and MATCAP enables 
the generation of viable adult mice with im- 
paired tubulin detyrosination. 


Impaired detyrosination affects brain volume 


Mice in which SVBP was removed (17), albeit 
having substantial amounts of detyrosinated 
tubulin, have smaller brains that resemble 
human patients with cognitive impairments 
and microcephaly. Mice that lack MATCAP 
showed a moderate but significant reduction 
of total brain volume, cerebellum, and olfac- 
tory bulb (Fig. 5, D and E) as quantified by use 
of magnetic resonance imaging (MRI) brain 
scans. The combined removal of MATCAP and 
SVBP led to a more significant decrease in 
brain size compared with that of single mu- 
tants (Fig. 5E) and a reduction in total brain 
weight and total cell number compared with 
those of wild-type mice (Fig. 5, F and G). To 
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colocalization of MATCAP mutants from immunofluorescent stainings. Data 
are shown as mean. Significance was calculated as one-way analysis of 
variance (ANOVA) and Tukey's multiple comparisons test. ****P < 0.0001, 
***P < (001. (J) Composite molecular model of MATCAP and VASHI-SVBP 
(PDB: 6WSL) bound to a microtubule. Arrows indicate interactions with 


address the origin of the small brain phenotype, 
we first measured apoptotic cells. Only very 
infrequent and similar amounts of apoptotic 
cells were detected in adult or embryonic 
brains of either genotype (fig. S8, C to E). Next, 
we examined cell proliferation during neu- 
rogenesis in embryos at embryonic day 14.5 
(E14.5). Immunostainings showed a signifi- 
cant decrease in mitotic phospho-histone H3- 
positive cells (Fig. 5, H and I). This decrease 
was observed in the ventricular and subven- 
tricular zone (Fig. 5J)—the proliferative zones 
of the embryonic brain (Fig. 5J)—which sug- 
gests a defect in proliferation of the progenitor 
cells. To examine whether this brain pheno- 
type led to behavioral phenotypes, we exam- 
ined nest building, a behavior that is sensitive 
to brain lesions in mice. When housed with 
nesting material overnight, double-mutant 
mice completely failed to construct nests, 
whereas MATCAP or SVBP single-mutant 
mice were largely able to build nests (Fig. 5K). 
Thus, these distinct detyrosinating enzymes 
have important and overlapping roles in brain 
development and behavior. 


Conclusion 


Whereas most posttranslational modifications 
of the o-tubulin tail are generated by large 
families of related enzymes (2), VASH-SVBP 
and MATCAP constitute isofunctional yet 
evolutionary unrelated enzymes. Their iden- 
tification now enables functional studies of 
detyrosination, which is further facilitated by 
the survival of detyrosination-impaired mice. 
Because detyrosination precedes retyrosina- 
tion in this biochemical cascade, their viability 
is in unexpected contrast with the postnatal 
lethality of TTL-deficient mice (33). As such, 
the absence of SVBP/MATCAP interferes with 
the generation of (i) detyrosinated tubulin, 
(ii) subsequent tubulin modifications pre- 
ceded by detyrosination (A2-tubulin forma- 
tion), (iii) microtubule heterogeneity through 
gradients of tyrosination levels, and (iv) the 
presence of a modification cycle affected by 
tubulin polymerization. Thus, the lethality of 
TTL mice may not be caused by disturbance 
of these processes but rather by the low levels 
of tyrosinated tubulin owing to nonreversable 
detyrosination. Taken together, our findings 
show similar activities for these two distinct 
classes of tubulin detyrosinases but at the same 
time highlight mechanistic differences between 
them. These differences lay a foundation to 
understand how distinct enzymatic properties 
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Fig. 5. Characterization of detyrosination-impaired mice. (A) |mmuno- 


histochemical staining of detyrosinated tubulin in adult mouse brains. 
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DNA (Draq-5, blue), phospho-histone H3 (PH3; proliferation 


(B) Immunoblot analysis of adult mouse brain lysates. (C) Microtubules isolated 


from adult mouse brains were subjected to nano 
microtubules with different C-terminal modificati 
of brain size of mice with indicated genotypes. 3 
exemplary raw data MRI slices (bottom) are sh 
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Tukey's multiple comparisons test. (F) Weights of five adult mouse brains per 
genotype. (G) Total number of brain cells counted by means of flow cytometry 


using the isotropic fractionator method; n = 5 
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marker, green) and Tujl (neuron marker, red) on coronal brain 
E14.5 mouse embryos. Scale bar, 20 um. (1) Quantification of m 
in the dorsal cortex at E14.5. Wilcoxon signed-rank test. (J) Qu 
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axis of the embryonic brain were analyzed. (K) Nest building 


tests of mice with the in 


dicated genotypes. Per genotype, n = 10 mice. In (E) to 


(G), (I), and (J), significance is depicted as ****P < 0.0001, ***P < 0.001, 
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may facilitate a more nuanced impact on the 
pool of heterogeneous microtubules. 


Materials and methods 
Reagents, constructs and treatments 


To enhance detyrosination cells were in- 
cubated with 2 uM Paclitaxel (Santa Cruz 
Biotechnology) for 3 hours. MATCAP (UniProt 
identifier Q68EN5-1) was cloned with or with- 
out 2xHA-tag into the pcDNA3.1(-) vector and 
also into the pLEX_305. VASHI, SVBP and 
GFP-tubulin tail constructs were generated 
in a previous study (17). Mutant variants of 
MATCAP were derived through site-directed 
mutagenesis. All generated constructs were 
confirmed by sanger sequencing. A rabbit 
anti-MATCAP antibody was generated by 
immunizing rabbits with full-length recombi- 
nant human MATCAP protein and obtained 
sera were affinity purified (David Biotechno- 
logie Gmbh). 


Cell culture 


Cell lines used include HAP1 (34), HCT116 
(RRID:CVCL_0291), U20S (RRID:CVCL_0042), 
HEK293T (RRID:CVCL_0063) and HeLa H1 
(RRID:CVCL_3334). Cells were cultured in 
IMDM (HAP1) and DMEM (HCT116, U20S, 
HEK293T, HeLa H1) supplemented with 10% 
fetal bovine serum (FBS), L-glutamine, pen- 
icillin and streptomycin at 37°C in 5% COg. 
Cell lines were monitored for mycoplasma 
contamination. 


Generation of cell lines 


Cell lines with gene mutations were generated 
either by lentiviral transduction of Cas9/gRNA 
cloned into the pLentiCrispr vector (including 
a blasticidin or puromycin resistance cas- 
sette) or by co-transfection of a vector with a 
blasticidin-resistance cassette and the pX330 
vector (Addgene 42230) including the Cas9/ 
gRNA. gRNA sequences were 

TTL (5'- GTTCTCATCGCGTACCACGA-3’), 

VASHI (5'-ACGGCTTCCAGGCATTTGAT-3’), 

VASH2 (5'-TTTGATAGGCAAGGACTCTC- 
3’), and 

MATCAP (5'-TGCGCACAATCGACCATATC-3’). 

After selection with antibiotics, individual 
clones for each cell line were derived. Genome- 
editing was monitored by amplifying genomic 
loci and subsequent sanger sequencing of PCR 
products. Primers used were 

VASHI (5'-GTGGGAGGTGGAAGTGAAA- 
GAAGG-3’, 5'-GCCAAGCAGATGACTGCA- 
TATGG-3'), 

VASH2 (5'-TGCTATGGAGTACTGGCAC- 
TAACTCC-3’, 5-GTCCAAGAAGCCAAGCATT- 
GATTTGG-3’), 

MATCAP (5'-GAGTACAGGGCCTGCTGGTCG- 
TITG-3', 5'-CGCCCACAAGACACCCTGATTTG-3’). 

Compound heterozygote clones were iden- 
tified with the TIDE web tool (35) and further 
confirmed by subcloning of the PCR amplicons 
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and sequencing. TTL deletion was confirmed 
by immunoblotting using the rabbit anti-TTL 
(Proteintech, 13618-1-AP, RRID:AB_2256858). 


FACS-based phenotypic screens in HAP1 cells 


Screens were carried out as described previ- 
ously (17, 36). Retrovirus was produced by 
transfection of HEK293T cells with gene-trap 
vector, packaging plasmids Gag-pol, VSVg, and 
pAdv. After 48 hours the viral supernatant was 
harvested and filtered using Amicon filters 
and applied together with protamine sulfate 
(8 ug/ml) to 40 x 10° HAPI cells. This proce- 
dure was repeated the next day. After expansion 
mutagenized HAP! cells were dissociated using 
Trypsin-EDTA and washed with phosphate- 
buffered saline (PBS). Cells were fixed with ice- 
cold methanol for 10 min on ice, resuspended 
in FACS buffer (PBS with 10% FBS) and single 
cells were ensured by applying cells to a 40 um 
strainer. After incubation with the primary 
antibody [rabbit anti-detyrosinated tubulin 
(5), rat anti-tyrosinated a-tubulin clone YL1/2 
(Thermo Fisher Scientific, MA1-80017, RRID: 
AB_2210201)] for 1.5 hours at room tempera- 
ture, cells were washed three times with PBS 
and then incubated for 1 hour with the sec- 
ondary antibody (Alexa 488, Invitrogen) and 
DAPI for 1 hour at room temperature. After 
three washes with FACS buffer, G1 cells (iden- 
tified by DAPI) within the highest or lowest 1% 
of staining signal were isolated using FACS. 
DNA was isolated and sequencing libraries 
were generated as described before (36). Deep 
sequencing reads from each sample (low and 
high) were aligned to HG38 allowing a single 
mismatch and assigned to non-overlapping 
protein-coding gene regions (Refseq). Unique 
disruptive integrations per gene body (intron 
and exon) of a gene were normalized to the 
total number of sense integrations of each 
sorted cell population. In this analysis the 
5'UTR region was excluded in contrast to pre- 
vious analysis (36). Then the mutation index 
(MI) was calculated by comparing normalized 
sense integrations of the high and low cell pop- 
ulation using a two-sided Fisher’s exact test. 
Fishtail blots were visualized using Python. Data 
used to plot the uniqueness across published 
haploid screens is available on Phenosaurus 
(https://phenosaurus.nki.nl). 


Immunofluorescence 


Cells were seeded on cover slips and fixed with 
ice-cold methanol for 4 min at -20°C. After 
1 hour in blocking buffer (5% bovine albumin 
serum in PBS) at room temperature, cells were 
incubated with primary antibodies in blocking 
buffer overnight. After three washes with PBS 
cells were incubated for 1 hour with second- 
ary antibodies (Alexa Invitrogen) in blocking 
buffer. Cell nuclei were counterstained with 
DAPI (1 ug/ml) for 10 min. Cells were mounted 
with Aqua-Poly Mount (Fisher Scientific) and 
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imaged on a confocal microscope (Leica SP5 or 
Zeiss LSM 980 with Airyscan). Primary anti- 
bodies used were mouse anti-HA (BioLegend, 
#901503, RRID_AB:2565005), rabbit anti- 
detyrosinated o-tubulin (by Marin Barisic) 
(5), rabbit anti-o-tubulin (Abcam, ab18251, 
RRID:AB_2210057). Images were processed 
in Fiji (37) or Zen Blue Software. Colocalization 
was measured as Pearson correlation coeffi- 
cient with the Fiji plugin EzColocalization (38). 
Colocalization was determined for cells within 
the same range of MATCAP intensity. 


Immunoblotting 


Cells were lysed directly in sample buffer con- 
taining 100 mM DTT. Proteins were separated 
on 4 to 12% Bis-Tris gels and transferred to 
nitrocellulose membranes. After blocking in 
TBS with 5% milk and 0.1% Tween20, mem- 
branes were incubated with primary and 
corresponding HRP-conjugated secondary 
antibodies (Thermo Fisher Scientific). Primary 
antibodies were anti-o-tubulin (Santa Cruz 
Biotechnology, sc-32293, RRID:AB_628412), 
rabbit anti-MATCAP (this study), rabbit anti- 
detyrosinated o-tubulin (Merck Millipore, 
AB3201, RRID:AB_177350), rabbit anti-A2 
a-tubulin (Merck Millipore, AB3203, RRID: 
AB_177351), rat anti-tyrosinated a-tubulin clone 
YL1/2 (Thermo Fisher Scientific, MA1-80017, 
RRID:AB_2210201), HRP-conjugated mouse 
anti-GAPDH (GeneTex, GTX627408, RRID: 
AB_2888014). 


MATCAP expression and purification 


Sequence analysis based on the Protein-CCD” 
application (39) was used to identify the cata- 
lytic domain of MATCAP. Synthetic sequences 
of the catalytic domain of MATCAP (137-471) 
and full-length MATCAP (1-471) were used for 
cloning. The wildtype constructs were used to 
introduce the nucleophilic E281Q mutation. 
All proteins were expressed in Spodoptera 
Srugiperda (Sf9) insect cells. In brief, a pFastBacl 
plasmid with a his10-twinstrepH-3C-MATCAP 
genetic construct was created through ligation 
independent cloning (LIC) to custom expres- 
sion vectors (40). Bacmid virus was produced 
with the Invitrogen Bac-to-Bac method and 
expressed in Sf9 insect cells using Insect-Xpress 
medium (LONZA). The cells were cultured 
in a shaker incubator at 28°C at 80 rpm and 
infected with the bacmid virus 8 hours after 
seeding a culture at 1 x 10° cells mlt. The 
insect cells were harvested 72 hours after the 
infection, snap-frozen in liquid nitrogen and 
kept at -20°C until usage. 

The cell pellet was thawed on ice, and lysed 
with 4 min sonication (10s on/30s off) in ice 
cold lysis buffer [40 mM HEPES/HCI, pH 7.5, 
500 mM NaCl, 1 mM Tris(2-carboxyethyl) 
phosphine (TCEP), 1 tablet Ethylenediamine- 
tetraacetic acid (EDTA) free protease inhibitor 
(Thermo Fisher Scientific) ]. To clear the lysate, 
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it was centrifuged (30 min, 54.000 g using a JA 
25.50 rotor, Beckman Coulter). The supernatant 
was incubated with 500 uL pre-equilibrated 
Chelating Sepharose Fast Flow beads (GE 
healthcare) to capture the his-tagged MATCAP. 
After 1 hour the beads were washed with an 
excess of washing buffer (20 mM HEPES/ 
HCl, pH 7.5, 500 mM NaCl, 1 mM TCEP) and 
MATCAP was eluted using an imidazole step- 
gradient (30-500 mM imidazole). The elution 
fractions that contained MATCAP were sup- 
plemented with 3C protease during a 4-hour- 
dialysis (20 mM HEPES/HCI, pH 7.5, 150 mM 
NaCl, 1 mM TCEP). Finally, cleaved MATCAP 
was gel-filtered using a s200 10/300 column 
(GE Healthcare) on a FPLC system (Bio-Rad) 
into 20 mM HEPES, pH 7.5, 140 mM NaCl 
(protein used for cryo-EM contained 100mM 
NaCl), 1 mM TCEP, 1 uM ZnCl, to remove re- 
maining impurities. The purity of the fractions 
was assessed with an 8% Coomassie SDS page 
gel (Invitrogen) and the protein was concen- 
trated with a 0.5 ml 10.000 MW Amicon-Ultra 
concentrator (Amicon) to a concentration 
above 5 mg ml‘. The concentrated protein 
was snap-frozen in liquid nitrogen and kept 
at -80°C until usage. 


Crystallization and data collection 


Purified MATCAP (137-471) was concentrated 
at about 15-20 mg ml’ for crystallization. 
Initial hits were obtained in a sitting drop 
format using 96-well two-drop MRC plates 
and the commercial screens PACT, JCSG+, 
Classics-I and II, ProComplex, ComPAS, pH- 
Clear and Morpheus. The drops consisted of 
100 nL protein with 100 nL reservoir solution. 
After a period of one week, initial hits were 
identified in the Classics-I screen and further 
optimized using hanging drops in Limbro 
plates. Diffracting crystals of MATCAP were 
obtained in 4.3 M NaCl and 0.1M HEPES/ 
NaOH pH 7. Crystals appeared as clusters, 
which were further optimized by using the 
additive screen from Hampton. Crystals grew 
to maximum dimensions over a period of 
two weeks and the most promising additive 
agent for growing single crystals was EDTA. 
We were unable to grow single crystals by 
varying the concentration of EDTA, but we 
succeeded in reducing the number of needles 
that grew in each cluster. Crystals were har- 
vested and cryo-protected in a solution con- 
taining 30% ethylene glycol and vitrified in 
liquid nitrogen. 

All data collection and refinement statistics 
are shown in table S1. A native crystal diffracted 
to 2.1A resolution at PSI XO6DA (PXIII, Villingen 
Switzerland) at the Swiss Light Source (SLS) 
synchrotron, using a PILATUS 2M-F detector. 
Crystal content analysis has indicated four mol- 
ecules in the asymmetric unit, with a solvent 
content 52.6%. As there was no structure of 
a close homolog of MATCAP and a reliable 
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homology model could not be constructed at 
the time, the structure had to be determined 
by experimental phasing. We therefore soaked 
crystals in mother liquor containing methyl- 
mercury chloride solution (10 mM) for ~3 min, 
prior to cryo-protection and vitrification. A 
highly-redundant Hg derivative dataset was 
collected also in PSI XO6DA at 1.00 A wave- 
length at the Hg L-III edge. Crystals belong to 
the same space group with very similar unit 
cell dimensions as the native. Following struc- 
ture determination, after we confirmed that 
MATCAP is a zinc-dependent metalloprotease, 
we removed EDTA from the crystallization 
conditions and managed to grow reasonably 
sized (but smaller and weaker diffracting) 
crystals of MATCAP (137-471) in the presence 
of Zn?* ions; these crystals diffracted to 2.5 A 
resolution. All data were also collected at PSI 
XO6DA. All diffraction data were integrated 
automatically by XDS (18). Final scaling was 
performed in AIMLESS (47) within the CCP4 
suite (42). Structures were visualized for figure 
panels with ChimeraX (43). 


Crystallographic structure determination 
and refinement 


The MATCAP structure was determined using 
the SIRAS phasing method, with the EDTA 
structure as native and the mercury soak as a 
derivative with anomalous scattering. Solvent 
content analysis identified that this crystal 
form had four molecules in the asymmetric 
unit, related by non-crystallographic symmetry. 
The HKL2MAP interface (44) was used to 
determine that the anomalous signal extended 
to about 4.0A, and that dataset was used to 
run SHELXD (45) using all low-resolution 
reflections. Most solutions from the different 
semi-random trials of the direct methods were 
considered reasonable based on internal soft- 
ware statistics (CC versus CCweak). The best 
solution identified 11 mercury atoms, four 
with occupancy between 0.85 and 1.0. These 
sites were virtually identical to those detected 
by the same software in the context of the 
CRANK2 pipeline (46). Phasing within CRANK2 
resulted to phases with a reasonable overall 
figure of merit (FOM) of 0.328. Following den- 
sity modification, the correct hand was deter- 
mined and the overall FOM improved to 0.67. 
Automated model building using PARROT 
(47) within the CRANK2 pipeline, resulted 
in a model containing 1,369 residues in ten 
fragments; 90% of the residues were docked 
in sequence. 

The structure was manually completed and 
refined with a few iterations of REFMAC (48) 
and rebuilding in COOT (49). The molecules 
are arranged by a rotation operator as a dimer, 
and that dimer is related to another dimer by 
pure translation, making up the four mole- 
cules in the asymmetric unit. At that point, we 
identified that MATCAP is a metalloprotease, 
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but that dataset clearly did not show zinc 
in the active site, owing to the presence of 
EDTA in the crystallization buffer. We thus 
focused our attention to the lower-resolution 
dataset that we collected with crystals grown 
with Zn*? in the absence of EDTA. We deter- 
mined that crystal structure by simply refin- 
ing the model built for the apo form against 
this new dataset. The electron density map 
(both the 2mFo-DFc and the phased anoma- 
lous difference Fourier synthesis) confirmed 
the presence of a single zinc ion at the active 
site. Remarkably, the zinc-bound structure had 
minor differences with the apo structure, even 
at the vicinity of the active site. Following sev- 
eral iterations of REFMAC, COOT and using 
PDB-REDO (50) the structure refined to an 
Rfree of 25.5%. It has no Ramachandran plot 
outliers, the RamaZ score is -1.50 (57), and the 
Molprobity score of 1.11 ranking in the 100th 
percentile (52), and has been deposited to the 
PDB (PDBID: 7Z5H). The apo structure, has 
subsequently also been refined to an Rfree of 
25.1%, with no Ramachandran plot outliers, 
RamaZ score of -0.91 (51), and a Molprobity 
score of 1.17 ranking in the 100th percentile (52), 
and has been deposited to the PDB (PDBID: 
7Z5G). We note that the AlphaFold model 
from the EBI database (22) shows an ordered 
structure only for this MATCAP region, and 
rms difference of 0.89 A from the crystallo- 
graphic model. 


HADDOCK modeling 


Awork-in progress model of MATCAP was used 
to predict the binding site of the C-terminal 
peptide into the active site of MATCAP using 
high ambiguity driven docking (HADDOCK) 
(23, 53). As a substrate we used a peptide 
with sequence GEEEGEEY. Before docking, 
the crystal structure of MATCAP (137-471) 
was truncated digitally to 147-471, as the con- 
formation of residues 137-147 looked like a 
crystallographic artefact and could be block- 
ing the entrance of the catalytic pocket. In our 
initial docking model, we used three types of 
restraints. First, we defined active residues 
of the positive patch directly opposed of the 
active site in MATCAP (residues 334, 373, 376, 
and 389). Second, we defined active residues 
in the peptide ‘EY° as confirmed by mass 
spectrometry. Thirdly, analogous to the DPPIII 
crystal structure (PDB: 3T6J) (54) we restrained 
the distance of the zinc catalytic ion to the 
attacked carbonyl to 1.7 to 3.7 A. Further- 
more, we specified that the C terminus of the 
8-meric peptide should be negatively charged. 
HADDOCK grouped 197 structures in three 
clusters (table S2). This initial docking model 
was confirmed by mutation studies of the ac- 
tive residues, and also suggested additional 
residues that could interact with the peptide. 
In a follow-up docking experiment, we used 
the original active residues ~'EY°, 334, 373, 
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376, and 389, and additionally also 179, 312, 
249, 393, and 417. As in the first model, the 
scissile bond was placed very close to the zinc 
ion, we decided to redefine the distance re- 
straints between the zinc and the scissile bond 
to 1.2 to 3.7 A. All other parameters were kept 
the same. This run of HADDOCK resulted 
to similar docking models as the first run; 
models were now grouped in two clusters 
(table S3). 


Microtubule isolation 


Microtubules were prepared from an 
HCT116 triple-mutant cell line (AMATCAP 
AVASHIAVASH2) or fresh mouse brains. 
Cells were harvested with PBS-EDTA using 
a cell scraper and resuspended in 1:1 v/v 
isolation buffer (100 mM PIPES/KOH pH 6.9, 
2mM EGTA, 1 mM MgCly, 1 mM PMSF, com- 
plete ultra-protease inhibitor tablet (Fisher 
Scientific)). Brains were homogenized in 
isolation buffer (1 1/mg) on ice. Samples were 
sonicated on ice and cleared by centrifuga- 
tion (1 hour, 100000xg, 4°C). Paclitaxel (final 
concentration 20 uM) and GTP (final concen- 
tration 1 mM) were added to the supernatant 
and incubated at 37°C for 30 min. Polymer- 
ized microtubules were then pelleted over a 
cushion (60% v/v glycerol, 20 uM paclitaxel 
in isolation buffer) for 40 min at 100000xg 
at 37°C. The pellet was resuspended in warm 
isolation buffer (fig. S6A). 


Cryo-EM grid preparation and data collection 


Purified full-length, E281Q inactive MATCAP 
was thawed and kept on ice (fig. S6, B and C). 
To increase microtubule binding, MATCAP 
(5.5 mg ml *) was diluted 1:1 v/v with low salt 
buffer (100 mM PIPES/KOH, pH 6.9, 2 mM 
K-EGTA, 1 mM MgCl,). 3.5 wL of the freshly 
prepared HCT116-derived microtubules were 
applied to a glow discharged (30 mA, 60 s, 
GloQube, Quorum) 300 mesh quantifoil holey 
carbon copper grid R1.2/1.3 (Quantifoil micro 
tools GmbH), which was blotted for 2 s, force 
10 in a humidified (80°C, 100% humidification) 
Vitrobot chamber (ThermoFisher Scientific) 
Subsequently, 3.5 uL diluted MATCAP was 
applied to the grid and allowed to adhere to 
the microtubules for 10 s. After 6.5 s of blot- 
ting (Whatman, cat no. 47000-100) the grid 
was plunge-frozen in liquid ethane and trans- 
ferred to liquid nitrogen until imaging. Grids 
were screened in JEM-F200 “F2” 200 kV 
(JEOL) with a K2 camera (Gatan). A dataset 
of 8400 micrographs was collected with EPU- 
2.10.0 on a 300 kV Titan Krios1 (ThermoFisher 
Scientific) with a K3 camera (Gatan) in super- 
resolution mode. We used a calibrated pixel 
size of 0.53 A/pixel with a nominal magnifi- 
cation of 81000x. A total dose of 50 electrons 
per micrograph divided over 50 frames with 
a total exposure time of 2.14s was used. The 
defocus range was -0.6 and -2.1 um. 
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Cryo-EM image processing and modeling 

Data analysis and processing was done in 
cryoSPARC (55) in the Oncode and RHPC GPU 
facility at the NKI, unless mentioned other- 
wise. First, micrographs were preprocessed: 
Imported micrographs were corrected for 
gain, patch-motion (56), and CTF (57) with an 
F-crop of 0.5, resulting in a working pixel size 
of 1.06 A/pixel. CTF-corrected micrographs 
were then used to pick particles on. First, 
five micrographs were used to manually pick 
particles to create picking templates. Using 
these templates, particles could then be re- 
liably picked with optimized parameters using 
the cryoSPARC filament picker job. Particles 
were subsequently extracted with a box size 
of 512 by 512 pixels and F-cropped to 128 by 
128 pixels. The 4x binned particle stack was 
iteratively classified in 2D to remove broken, 
contaminated and clearly undecorated micro- 
tubule particles. To sort the particles to differ- 
ent microtubules classes, we created initial 
microtubule models (11 to 16 protofilaments) 
using the helical refinement option with pa- 
rameters as listed in Table S4. The particles 
were then sorted over the different microtubule 
types with a heterogeneous refinement job. 
The 14 protofilament containing microtubule 
particles showed the best signal-to-noise ratio 
and contained most particles. The particles 
of this class were selected, un-binned to an 
F-crop of 416x416, and used for a subsequent 
helical refinement job, creating a clean 14 pro- 
tofilament microtubule model and allowing 
good alignment of the particles. The helical 
refinement job, however, is unable to align 
particles to the characteristic microtubule seam. 
Consequently, these maps did not differenti- 
ate between o and Bf monomers, and thus the 
density for MATCAP was blurred along the 
protofilament axis, preventing further analysis. 
To overcome this, particles were 14x symmetry 
expanded and used for local refinement with 
a mask spanning a 2 by 2 tubulin lattice (fig. 
S6D). This mask was created using Chimera 
(58) as follows: we first docked the crystal 
structure of a tubulin dimer to two protofila- 
ments; then we placed one MATCAP catalytic 
domain per tubulin dimer in the (blurred) 
density; finally, the molmap command was 
used to create an asymmetric simulated map 
of four MATCAPs bound to four tubulin dimers 
belonging to two adjacent protofilaments. 
After local refinement with this big mask, 
the characteristic loop differentiating o and 
6 tubulin was clearly visible (fig. S5C). At this 
point the crystal structure of MATCAP could 
be fitted in the map reliably, in a unique ori- 
entation. Then, a smaller mask was created, 
containing only one protofilament and the 
bound MATCAP. A final local refinement with 
this smaller mask resulted in the final map. No 
preferential orientation was observed. (fig. S6E) 
The map has a nominal resolution of 2.9 A and 
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a median resolution of 5.3 A. (fig. S6F and Table 
$5). A model was obtained by rigid body fitting 
(without further refinement) MATCAP (PDB: 
7Z5H) and tubulin (PDB: 7SJ7) into the map. 
The locally refined map and the model were 
deposited to the PDB and EMDB (PDB: 7Z6S, 
EMDB: EMD-14529). To ease visualization, 
the local resolution of the map was estimated 
(fig. S6G-J) (59) and locally filtered by using 
cryoSPARC. The HADDOCK cluster 2 model 
agreed well with the cryo-EM density and was 
therefore connected to the last modeled resi- 
due of tubulin for a more intuitive interpreta- 
tion, using Coot (fig. S7A) (49). Buried surface 
areas between the contacting interfaces were 
calculated by using PDBePISA (60). Maps 
were visualized and figures were created by 
using ChimeraX (43). 


In vitro peptide assay 


Peptides incubated with MATCAP were ana- 
lyzed as decribed previously (24). Briefly, 
500 uM peptides were incubated with 500 nM 
MATCAP for 4 hours at 37°C. Reactions were 
frozen until analysis. Samples were passed 
through a Microcon-10 spin filter (Millipore), 
diluted with 2% formic acid and analyzed by 
LC-MS/MS on a Thermo Orbitrap Fusion 
hybrid mass spectrometer (Q-OT-qIT, Thermo 
Scientific) equipped with a Proxeon nLC 1000 
system (Thermo Scientific, Bremen), using a 
30min gradient. RAW files were processed 
using Proteome Discoverer (version 2.5.400, 
Thermo Fisher Scientific). MS2 spectra were 
searched against a custom database containing 
the mutagenized synthetic versions of tubulin- 
like C-terminal peptides, using SequestHT. No 
enzyme and miscleavages were allowed and 
data were filtered with Xcorr score >1. Deter- 
mination of the extent of peptide cleavage 
by MATCAP -and subsequent assessment of 
VASHI substrate specificity- was performed 
based on the number of peptide spectrum 
matches (PSM) for the intact and cleaved/ 
detyrosinated version of each peptide. 


Analysis of TUBA1A/B C-termini from mouse 
brain microtubules 


Mouse brain tubulin pellets were heated for 
5 min at 95°C in 1x S-Trap lysis buffer (50mM 
TEAB pH 8.5, 5% SDS), after which 30 yg pro- 
tein amounts (as determined by BCA Protein 
Assay Kit, Thermo Fisher Scientific) were di- 
gested on S-Trap Micro spin columns accord- 
ing to the manufacturer’s instructions (ProtiFi, 
NY, United States), using 1:15 (enzyme:protein) 
Lys-N (ImmunoPrecise Antibodies Europe BV, 
Utrecht, The Netherlands). Digests were ana- 
lyzed by LC-MS/MS on the system described 
above, using a 210-min gradient. RAW files were 
analyzed by MaxQuant (version 2.0.1.0) (67) 
using standard settings. Spectra were searched 
against the reviewed Swissprot M. musculus 
database (2021_04, 17073 sequences) with 
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semi-specific LysN as cleavage specificity, 
allowing the identification of full-length and 
truncated versions of the tubulin C terminus. 
The amount of C-terminal detyrosination was 
calculated for each sample, based on the LFQ 
abundances of (truncated) terminal peptides. 


In vitro detyrosination assays 


The detyrosination activity on tubulin dimers 
was assessed by incubating 50 nM Hela 
a/B-tubulin dimers (Cytoskeleton) with 25 nM 
recombinant MATCAP protein in 50 mM 
Tris-HCl, pH 8, 10% glycerol, 1 mM DTT and 
1 mM PMSF for 45 min at 37°C. MATCAP 
was inhibited with 25 1M EDTA. For testing 
MATCAP’s activity on microtubules versus 
a/B-tubulin dimers, 3 mg/ml HeLa o/f-tubulin 
(Cytoskeleton) were polymerized into micro- 
tubules by the addition of 1 mM GTP, 10% 
glycerol and stepwise addition of paclitaxel 
(0.3 to 3 to 30 uM every 10 min). The reaction 
was incubated at 37°C for 60 min. Micro- 
tubules equivalent of 500 nM tubulin dimers 
and 10 nM MATCAP or 2nM MATCAP?*"47 
protein were incubated in 50 mM Tris-HCl, 
PH 8, 10% glycerol, 1 mM DTT and 1 mM PMSF 
for the indicated times at 37°C. Detyrosination 
levels were quantified by immunoblot anal- 
ysis. To estimate the intensity of each band, the 
sum of pixel values in a square block, larger 
than the band, were measured and corrected 
by the subtraction of a background value using 
Image Lab software (Bio-Rad). To account for 
differences in the efficacy of blotting between 
experiments, a linear scale was applied be- 
tween equivalent datasets from different ex- 
periments. Scale factors were calculated by 
linear regression to minimize the least squared 
difference between the points of all datasets. 
For each dataset, the detyrosination activity 
was normalized by the measured intensity for 
a-tubulin. A linear model was fit to estimate 
the rates of hydrolysis for the two reactions. 


Co-sedimentation assay 


Microtubules were assembled by incubating 
porcine tubulin dimers (Cytoskeleton) with 
GTP (1 mM), glycerol (10%) and paclitaxel 
(30 uM) at 37°C for 30 min. MATCAP protein 
(0.5 1M) and in vitro assembled microtubules 
(corresponding to 1 uM tubulin dimers) were 
incubated at room temperature for 20 min 
and then pelleted (100000xg, 35°C, 40 min) 
over a paclitaxel-glycerol cushion. The super- 
natant and microtubule-containing pellet were 
then analyzed by immunoblotting. 


Use, generation and maintenance of 
mouse strains 


Mouse experiments were approved by the 
Animal Ethics Committee of the Netherlands 
Cancer Institute (AVD301002017840 authorized 
by the national agency Centrale Commissie 
Dierproeven) and implemented in accord- 
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ance with institutional, national and European 
guidelines for Animal Care and Use. SVBP- 
deleted mice (57BL/6), as described before (7), 
were recovered by in vitro fertilization of FVB 
oozytes. MATCAP KO mice were generated on 
FVB background (Janvier, FVB/NRj) using two 
gRNAs (5’-CTGCATGTATTTGCGCACGG-3’, 5'- 
AGATGATTGGCTCGGGCATT-3’) to create a 
1348 bp deletion, which destroys the catalytic 
HEIGHT motif. SVBP and MATCAP genomic 
loci were genotyped by PCR amplification 
using the following primers: 

MATCAP 5'’-GCATCATTGAGGCTGTCCTG- 
GAG-3', 

5'-CACCATCATCACAGCCTGGGAC-3’, 

5'-GCAGCCTGTGAGAAGGATAACCA-3' 

SVBP 5’-GATCCACCTGCCCGGAAA-3’, 

5'-CAAACCATGGATCCACGAAA, and 

5'-TTTCTTCCAGCACCCTCTCC-3’. 

For genotyping of MATCAP locus an anneal- 
ing temperature of 67°C and 45 s extension 
time and for the SVBP locus 50°C for 30 s, 
respectively. Amplicons were analyzed on a 1% 
agarose gel. For in vivo studies randomization 
and statistical methods to predetermine the 
sample size were not performed. All recorded 
measurements were included. 


In vivo MRI scanning of mouse brains 


Mice were injected intraperitoneally with 
0.4 mmol/kg dose of 30 mM Manganese Chlo- 
ride (MnCl,) 24 hours before scanning them 
individually with ParaVision 6.0.1 software in 
a7 Tesla BioSpec 70/20 USR MRI system with 
a #23 mm 'H transmit-receive mouse brain 
volume coil (RF RES 300 1H 023 M.BR QSN 
TR 2.4M), all from Bruker BioSpin. During the 
acquisition, mice were anaesthetized with 
1-2% isoflurane in a mix of air and oxygen. 
Acquisition parameters are Tl-weighted, 3D 
gradient echo sequence, TR = 31 ms, TE = 
7.7 ms, flip angle = 37°, field-of-view = 20 by 
15 by 17 mm, matrix size = 200 by 150 by 170, 
number of averages and repetitions = 1, total 
acquisition time = 20 min with respiratory trig- 
ger, isotropic resolution = 100 um. For quan- 
tifications, DICOM files were loaded into 3D 
Slicer (62). Brain structures were manually 
outlined on every third (substructures) or tenth 
(total brain) slice. Then slices in-between were 
automatically filled using the segmentation 
editor tool, the selection was visually inspected 
and if needed manually refined. Measurements 
were made using the quantification tool in 3D 
Slicer. 3D reconstructions are shown as ortho- 
graphic renderings. Measurements were done 
blinded. Statistical tests were performed using 
GraphPad Prism. 


Counting of brain cells 


The total number of brain cells per adult mouse 
brain was counted with the isotropic fraction- 
ator method (63). Briefly, intact nuclei were 
isolated from paraformaldehyde (PFA)-fixed 
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adult brains by homogenization in a 40 mM 
sodium citrate, 1% Triton X-100 solution. 
Washes from rinsing the glass homogenizer 
were collected and centrifuged (15 min at 
4000 xg) together with the homogenate. Pel- 
leted nuclei were resuspended in PBS with 
0.5 ug/ml DAPI. A fraction of the nuclei solu- 
tion was mixed with 50 ul CountBright Ab- 
solute Counting Beads (Thermo Fisher) and 
technical duplicates for each sample were ana- 
lyzed on a flow cytometer. The flow cytometer 
operator was blinded. Acquired measurements 
of cells and beads were used to calculate the 
cell concentration of each sample according to 
the manufacturer’s manual. 


Nest building test 


Nest building behavior was assessed as previ- 
ously described (64). Briefly, adult mice were 
moved to individual cages with one flat enrich- 
ment sheets (7 g) per cage. The next morning, 
nests were scored using the following scale: 
(i) nests with wall higher than mouse body 
height, (ii) flat nest, (iii) no clear nest but 
>50% shredded nest material, (iv) no nest but 
>50% shredded nest material, (v) no visible 
engagement with the nesting material at all. 
The experimenter scored the nests blinded. 


Immunohistochemistry 


Sections of formalin-fixed tissues were depar- 
affinized and rinsed three times with wash 
buffer (0.05% Tween20 in PBS). Antigen re- 
trieval was performed for the deY-tubulin 
(Citric buffer from Scytek, CBB999, 20 min, 
98°C) and cleaved caspase 3 (Tris/EDTA pH 9, 
30 min, 96°C) staining. After inactivation of 
endogenous peroxidases with 3% H,O, in 
Methanol, again rinsed three times and blocked 
in 4% bovine serum albumin (BSA) 5% normal 
goat serum in PBS in normal goat serum. Pri- 
mary antibodies were applied overnight at 
4°C. After three washes sections were incu- 
bated with Envision+ System-HRP labeled 
antibodies. After another three washes with 
wash buffer, sections were visualized with 
DAB+ (Dako, K3468, 1 ml substrate buffer + 
20 ul Chromogen) for 3 min, rinsed with 
demiH,0, counterstained with Haematoxylin, 
dehydrated and mounted with Entellan. Pri- 
mary antibodies were rabbit anti-detyrosinated 
tubulin (Barisic lab) and cleaved caspase 3 (Cell 
signaling, #9661, RRID:AB_2341188). Apoptotic 
cells were quantified using QuPath (65). 

For mouse brain stainings, embryonic brains 
dissected at E14.5 were fixed for 7 hours with 
4% PFA in PBS. After three PBS washes brains 
were cryopreserved using a 30% sucrose sol- 
ution, and then frozen in 7.5% gelatin plus 
15% sucrose solution. Coronal cryosections 
were obtained at 12 um thickness and post- 
fixed with 4% PFA for 10 min, washed with 
1x PBS, then treated with 0.3% Triton X-100 
for 5 min, and incubated in blocking solution 
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INTRODUCTION: Since the theoretical proposals 
of the first topological insulators (TIs), solid- 
state realizations of TIs and topological semi- 
metals (TSMs) have been discovered at a rapid 
pace. The recently introduced theories of 
topological quantum chemistry and symmetry- 
based indicators (SIs) have in particular fa- 
cilitated the discovery of topological phases of 
matter and large-scale searches for materials 
with experimentally accessible topological 
properties at the Fermi energy (£;). Away 
from Ey, energetically isolated bands and 
topological gaps are also useful in numerous 
experimental settings; they can be accessed 
via (electro)chemical doping, electrostatic gat- 
ing, hydrostatic pressure, and nonequilibrium 
photoexcitation and are relevant to Floquet 
engineering and nonlinear optical experiments. 
We have completed a catalog of stable and 
fragile topology in all of the bands and band- 
gaps both at and away from £; in the materials 
contained in the Inorganic Crystal Structure 
Database (ICSD). We have made our results 


A A majority of materials are topological at E, 
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publicly accessible through a substantial up- 
grade of the Topological Materials Database 
(https://topologicalquantumchemistry.com/). 


RATIONALE: At the time of our study, the ICSD 
contained 193,426 entries, of which 96,196 char- 
acterized stoichiometric chemical compounds 
with processable (noncorrupt) CIF structure 
files. We performed first-principles calculations 
on all 96,196 processable ICSD entries, resulting 
in convergent electronic structures for 73,234 
entries when taking into account the effects of 
spin-orbit coupling. For each convergent elec- 
tronic structure calculation, we computed the 
topological class and stable and fragile SIs of 
all bands above the core shell up to an ener- 
getic cutoff set to a filling of roughly twice the 
number of valence electrons. The 73,234 con- 
vergent electronic structures were grouped 
into 38,298 unique materials by common chem- 
ical formulas and stable topology at Ep. 

Our calculations represent the completion 
of the symmetry-indicated band topology of 
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Topological materials statistics and repeat-topological surface states. (A and B) Percentage of known 
3D materials that are topological at intrinsic electronic filling (52.65%) (A) and contain at least one 
topological band (87.99%) (B). (C) The (0001)-surface states of Bi2Mg3 obtained from surface Green's 
functions. Topological Dirac-cone surface states appear at I both slightly above the Fermi level (Er gap) and 
in the next gap below Er (RTopo gap), revealing the topological origin of the “surface resonance bands” 


observed in previous ARPES experiments. 
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known nonmagnetic materials, and a doubling 
of the number of materials accessible in pre- 
vious topological material catalogs. We reveal 
the existence of repeat-topological (RTopo) 
materials with stable topological insulating 
(TI) gaps at and just below Fy, and super- 
topological (STopo) materials in which every 
energetically isolated set of bands above the 
core shell is stable topological. 


RESULTS: Our findings recontextualize several 
previous experimental investigations of topo- 
logical materials. We find that the transition- 
metal chalcogenides Ta,NiSe; in space group 
(SG) 15 (C2/c) and Ta,NiSe, in SG 12 (C2/m)— 
previously highlighted for hosting exciton- 
insulator and charge density wave phases, 
respectively—are 3D TIs in their normal states. 
Additionally, we find that the higher-order TI 
rhombohedral bismuth in SG 166 (R3m) and 
the indirect-gap TI Bi,Mg; in SG 164 (P3m1) 
are both RTopo and STopo materials. Previous 
angle-resolved photoemission spectroscopy 
(ARPES) investigations of Bi.Mg; have dem- 
onstrated the presence of “surface resonance 
bands” below E;; here, we identify these bands 
as RTopo Dirac-cone surface states. Our com- 
putations reveal that 52.65% of all studied 
materials are topological at Ey, roughly two- 
thirds of bands across all materials exhibit 
symmetry-indicated stable topology, and 87.99% 
of all materials contain at least one connected 
set of bands with symmetry-indicated stable 
or fragile topology. 


CONCLUSION: Our findings motivate the for- 
mulation of a periodic table of chemical com- 
pounds based on the ubiquity of nontrivial 
electronic band topology in solid-state mate- 
rials. The relative preponderance of RTopo and 
STopo materials in the ICSD, combined with 
our recognition that the surface resonance 
bands in Bi,Mg; are RTopo surface states, 
suggests that many of the surface resonances 
detected in previous ARPES investigations are 
in fact topological surface states protected 
by spectral flow below Ey. Lastly, as there are 
known TI and TSM phases that cannot be 
detected by SIs, our results represent a lower 
bound on the percentage of materials in 
nature with topologically nontrivial elec- 
tronic properties; future investigations may 
reveal that an even higher percentage of solid- 
state materials exhibit previously overlooked 
topological properties. 


The list of author affiliations is available in the full article online. 
*Corresponding author. Email: regnault@princeton.edu (N.R.); 
maiagvergniory@dipc.org (M.G.V.); bwieder@mit.edu (B.J.W.) 
{These authors contributed equally to this work. 

Cite this article as M. G. Vergniory et al., Science 376, 
eabg9094 (2022). DOI: 10.1126/science.abg9094 


S READ THE FULL ARTICLE AT 
https://doi.org/10.1126/science.abg9094 


science.org SCIENCE 


RESEARCH 


RESEARCH ARTICLE 


TOPOLOGICAL MATTER 


All topological bands of all nonmagnetic 


stoichiometric materials 


Maia G. Vergniory??**+, Benjamin J. Wieder*®**+, Luis Elcoro’, Stuart S. P. Parkin®, Claudia Felser®, 


B. Andrei Bernevig®, Nicolas Regnault®?* 


Topological quantum chemistry and symmetry-based indicators have facilitated large-scale searches 
for materials with topological properties at the Fermi energy (E;). We report the implementation of a 
publicly accessible catalog of stable and fragile topology in all of the bands both at and away from E;- 
in the 96,196 processable entries in the Inorganic Crystal Structure Database. Our calculations, which 
represent the completion of the symmetry-indicated band topology of known nonmagnetic materials, 
have enabled the discovery of repeat-topological and supertopological materials, including rhombohedral 
bismuth and BizMg3. We find that 52.65% of all materials are topological at E-, roughly two-thirds 

of bands across all materials exhibit symmetry-indicated stable topology, and 87.99% of all materials 
contain at least one stable or fragile topological band. 


he field of solid-state physics has over 

the past 15 years been advanced in large 

part by the discovery of nontrivial elec- 

tronic band topology in nonmagnetic 

crystalline solids (/, 2). Since the con- 
ceptual proposal and theoretical prediction 
of the first two-dimensional (2D) (3, 4) and 
3D (5) topological insulators (TIs), research- 
ers have predicted solid-state realizations of 
topological phases of matter at a rapid pace. 
Notable examples include symmetry-protected 
topological semimetals (TSMs) (6-10) and topo- 
logical crystalline insulators (TCIs) (17-17). 
Owing to negligible electron-electron interac- 
tions, many of the theoretical predictions of 
TI, TCI, and TSM phases in solid-state mate- 
rials were quickly confirmed in spectroscopic 
and transport experiments (18-27). 

Initially, solid-state TIs and TSMs were 
believed to be rare and esoteric phases of 
matter. However, over the past 4 years, the 
theories of topological quantum chemistry 
(TQC) (28, 29) and symmetry-based indica- 
tors (SIs) (16, 17, 30-36) have facilitated high- 
throughput searches for magnetic (37) and 
nonmagnetic (38-40) topological materials, 
revealing solid-state TIs and TSMs to be ubiq- 
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uitous in nature. For example, Vergniory et al. 
(38) reported high-throughput analyses of the 
symmetry-indicated stable band topology at 
the Fermi level (£;) in ~26,000 stoichiometric, 
inorganic, “high-quality” crystal structures ob- 
tained from the Inorganic Crystal Structure 
Database (ICSD) (41). Of the ~26,000 ICSD 
entries, roughly ~7000 were found to be to- 
pological at Ep (38). 

This result raises two questions. First, the 
complete ICSD contains 96,196 stoichiomet- 
ric ICSD entries with processable (noncorrupt) 
structural data as defined in text S4 (42), raising 
the question of whether a high percentage of 
all known inorganic materials are topologi- 
cal at Ey. Second, energetically isolated group- 
ings of bands may exhibit nontrivial topology, 
independent of the electronic filling. Hence, 
the distribution of topological bands away 
from £; in real materials is also a major out- 
standing question in the study of quantum 
matter. In this work, we answer both questions 
by performing a complete study of symmetry- 
indicated band topology in all known, stoichio- 
metric, inorganic crystalline solids without 
magnetic order. In contrast to the previous 
high-throughput topological materials searches 
in (38-40), our calculations included a com- 
plete diagnosis of symmetry-indicated band 
topology away from Fy. Using TQC and non- 
magnetic SIs, we computed the symmetry- 
indicated, nonmagnetic band topology of all 
bands in all 96,196 stoichiometric ICSD entries 
with valid structure files from electronic fill- 
ings ranging from the core shell up to at least 
2N,. above, where JN, is the number of valence 
electrons. Unlike in (38-40), the SIs used in 
this work include the SIs of stable band 
topology—which indicate familiar strong and 
weak TI and TCI phases—as well as the re- 
cently introduced SIs of fragile band topol- 
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ogy (43-46)—which represent more exotic TCI 
phases with topological corner modes (34) 
and twisted-boundary edge states (47). Fragile 
topological phases have emerged as an area 
of intense interest after recent theoretical 
studies indicated that the superconducting 
and correlated-insulating states in magic-angle 
twisted bilayer graphene may originate from 
nearly flat fragile topological bands (48, 49). 

Through our calculations, we discovered 
previously unrecognized classes of topologi- 
cal materials, including enforced TSMs with 
energetically isolated fragile bands at Ep, 
repeat-topological (RTopo) materials with 
stable topological insulating gaps at and just 
below Ep, and supertopological (STopo) mate- 
rials in which every energetically isolated set 
of bands above the core shell is stable topo- 
logical. We have upgraded the Topological 
Materials Database (50)—a publicly available 
online catalog of topological materials—for 
accessing and intuitively searching the results 
of this study. In (42), we present detailed 
statistics for our computations, offer lists of 
idealized materials in each topological class, 
and highlight the features of the Topological 
Materials Database (50) implemented for 
this work, which include dynamical zoom 
options, density-of-states calculations, electronic- 
structure calculations in the absence of spin- 
orbit coupling (SOC), and advanced search 
options (see text S7 for a complete list of added 
features). We find that 52.65% of all mate- 
rials are topological at Ep, and roughly two- 
thirds of bands across all materials exhibit 
the symmetry-indicated topology of 3D strong 
TIs, weak TIs, TCIs, and higher-order TIs. 
These phases are classified as topologically 
stable, as they are robust to the addition of 
trivial or fragile bands (although sensitive to 
the relaxation of time-reversal and crystal sym- 
metries) (7-5, 11-17, 29, 30). Most surprisingly, 
we find that 87.99% of all materials contain 
at least one stable or fragile topological band 
in their energy spectrum, even if away from 
Ey. Our discovery of ubiquitous electronic band 
topology in solid-state materials motivates 
the formulation of a new periodic table of 
chemical compounds in which electronic bands 
in materials are sorted by a combination of 
common topological features and chemical 
and structural properties. 

Lastly, our characterization of electronic 
band topology away from Ey is immediately 
useful in numerous experimental settings, 
including angle-resolved photoemission spec- 
troscopy (ARPES) experiments, which mea- 
sure states at and below Ey, and pump-probe 
experiments, in which electrons can be excited 
to observe bands above E;. As discussed below, 
our theoretical calculations provide robust ex- 
planations for previously puzzling ARPES 
data. Beyond photoemission experiments, 
states away from Ey; may also be accessed 
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via (electro)chemical doping, electrostatic 
gating, hydrostatic pressure, and nonequilib- 
rium photoexcitation, and are relevant to 
Floquet engineering and nonlinear optical 
experiments (57). Additionally, even in topo- 
logically trivial insulating materials, exotic 
interaction effects may be accessed by doping 
or gating the Fermi level into an isolated topo- 
logical band. For example, independent of the 
cumulative band topology at higher energies, 
if a superconducting state is induced from a 
partially occupied, isolated band, then the non- 
trivial stable or fragile topology of the isolated 
band has been shown to provide a lower bound 
on the superfluid weight (52, 53). 


Dataset generation 


In this work, we applied TQC (28, 29) and 
stable and fragile SIs (16, 17, 30-37, 43-46) 
to diagnose the symmetry-indicated topology 
of all isolated bands above the core shell in the 
stoichiometric materials in the ICSD. To ob- 
tain the topological classification of each sepa- 
rated group of bands in the energy spectrum, 
we adopted the methods previously used in 
the high-throughput material searches in 
(37, 38). Uniquely in this work, after com- 
puting the electronic structure using the 
intrinsic electronic filling v = NV, we varied v 
to compute the topological properties above 
and below Ey. Below, we summarize the nu- 
merical and topological details of our calcu- 
lations (Fig. 1); further specific details are 
provided in texts S2 to S4. 

We began by selecting one of the 96,196 pro- 
cessable entries in the ICSD with a stoichio- 
metric chemical formula. We next converted 
the crystal structure listed in the ICSD into 
an input file for density functional theory 
(DFT) calculations, excluding cases in which 
the ICSD structure file was missing atoms 
listed in the chemical formula or reported 


Fig. 1. Workflow schematic for gener- 
ating the Topological Materials 
Database. For each entry in the 
Inorganic Crystal Structure Database 
(ICSD) (41), we first determine whether 
the entry is stoichiometric, contains 
coordinates for all of the atoms listed in 
the chemical formula, and lists atomic 
coordinates compatible with the crystal- 


atomic positions incompatible with the sym- 
metries of the material space group (SG) 
(text S4). We then performed self-consistent, 
ab initio calculations of the electronic band 
structure and density of states with v set to 
charge neutrality (v = N,), filtering out cases 
in which the calculations did not converge or 
converged to a magnetic (meta)stable state. 

Next, as detailed in text S3, we used the 
VASP2Trace (54) and Check Topological Mat 
(55) programs to determine the unitary sym- 
metry eigenvalues of each of the Bloch-state 
multiplets at each of the maximal (high- 
symmetry) x vectors. For each degenerate 
multiplet of Bloch states, the combination of 
extracted symmetry eigenvalues establishes 
a correspondence to an irreducible small co- 
representation (corep) of the little group at k 
(56). Then, choosing increasing integer val- 
ues of the valence electronic filling v from 0 
to at least 2, (or up to the first filling v > 
N, at which the electronic band structure 
is not symmetric and convergent; see text 
S3A), we identified fillings at which all of the 
Bloch states in each maximal & vector are fully 
occupied, such that a gap is present in the 
energy spectrum at each maximal & vector 
(although not necessarily at the same energy, 
as states are taken to be separately filled up to 
v at each & point). Lastly, at v = N., as well as 
at each v at which there is a gap at all maxi- 
mal & points, we extracted the symmetry data 
vector (29, 37, 38), defined as the multiplicities 
of the small coreps corresponding to the filled 
Bloch-eigenstate multiplets across each of the 
maximal * vectors. 

Crucially, the symmetry data vector at each 
v facilitates the topological classification of 
filled or energetically isolated groupings of 
bands. Specifically, in each SG, the elementary 
band coreps (EBRs) correspond to the indepen- 
dent, topologically trivial [i.e., “Wannierizable” 


ographic space group of the material. We then perform self-consistent DFT calculations, both with and 
without incorporating the effects of spin-orbit coupling (SOC). Irrespective of whether SOC is taken 

into account, each DFT calculation consists of four steps (labeled VASP1 to VASP4; see text S4 for details) 
in which we perform a self-consistent calculation (SCC) of the charge density, obtain the symmetry data 
small corepresentation (corep) multiplicities] at all integer electronic fillings above the core shell, and 
compute the electronic band structure (BS) at Er and the density of states (DOS). Lastly, using the 
VASP2Trace program (54) previously implemented for (38) and an updated version of the Check Topological 
at program on the Bilbao Crystallographic Server (55) implemented for this work, we compute for each 
DFT calculation the stable and fragile topological classifications of all isolated groupings of bands above 


the core shell as determined by the compatibility relations in topological quantum chemistry (28) (see texts 


S2 and S3 for details). 
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(57)] bands (28, 58). Hence, if a set of bands 
does not transform in an integer-valued linear 
combination of EBRs, then the bands exhibit 
nontrivial stable topology. As established in 
(37, 38) and detailed in text S2, a symmetry data 
vector can be classified into one of five possible 
topological classes. First, if the symmetry data 
vector corresponds to a set of Bloch states in 
which a degenerate multiplet at a maximal k 
point is partially filled, then the system is an 
enforced semimetal with Fermi degeneracy 
(ESFD). As an example, 3D HgTe [ICSD 31845, 
SG 216 (F43m)] is an experimentally estab- 
lished, ESFD-classified TSM that realizes a 
2D TI phase in few-layer quantum-well geom- 
etries (4). Next, if the symmetry data vector 
corresponds to a set of high-symmetry-point 
Bloch eigenstates that are implied by the com- 
patibility relations to connect to other states 
outside of the symmetry data along a high- 
symmetry line or plane, then the system is 
an enforced semimetal (ES). A well-studied 
ES material is the archetypal higher-order- 
topological Dirac semimetal Cd3As,. [ICSD 
107918, SG 137 (P4_/nmce)] (6, 19, 59, 60). Con- 
versely, if the symmetry data vector satisfies 
the compatibility relations along all high- 
symmetry lines and planes, then the bands 
that transform in the symmetry data either 
correspond to a stable or fragile TI or TCI, 
correspond to a TSM phase, or are topolog- 
ically trivial. In the case in which the symmetry 
data vector corresponds to a stable topological 
set of bands, the bands described by the sym- 
metry data may be classified into two cate- 
gories. If stable topological bands transform in 
an integer-valued linear combination of pieces 
of a disconnected EBR, then the bands are 
classified as a split EBR (SEBR) (28, 58). Alter- 
natively, if stable topological bands do not 
transform in an integer-valued linear com- 
bination of disconnected EBR pieces, then the 
bands are classified as “not equal to a linear 
combination” (NLC). The archetypal symmetry- 
indicated stable topological material is the ex- 
perimentally established 3D TI Bi,Se; [ICSD 
617079, SG 166 (R3m)] (18), which is classified 
as SEBR. Lastly, if the symmetry data of an 
isolated grouping of bands does correspond 
to an integer-valued linear combination of 
EBRs (LCEBR), then the symmetry eigenvalues 
of the occupied bands are compatible with 
either fragile or trivial topology [although 
the bands may also exhibit non-symmetry- 
indicated stable topology (34)]. 

In the above cases in which the symmetry 
data vector satisfies the compatibility relations 
(SEBR, NLC, and LCEBR), a finer topological 
classification can be obtained using stable and 
fragile topological SIs. First, in the presence 
of non-negligible SOC, nonmagnetic SEBR 
and NLC bands correspond to stable TIs and 
TCIs with anomalous surface Dirac cones and 
helical hinge states (16, 17, 30). Conversely, as 
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shown in (29, 31, 34, 36) and detailed in text 
S6, in the absence of SOC and magnetism, 
bands with nontrivial stable SIs correspond 
to TSM (specifically SEBR-SM and NLC-SM) 
phases with bulk nodal degeneracies in the 
Brillouin zone (BZ) interior and topological 
surface Fermi arcs or flat band-like surface 
and intrinsic hinge states. Lastly, it was re- 
cently discovered that LCEBR bands with or 
without SOC may in some cases be further 
classified as fragile topological (43, 44) through 
the fragile SIs elucidated in (45, 46). For all 
possible classes of symmetry-indicated sta- 
ble and fragile TI, TCI, and TSM phases in 
the presence of SOC, we provide a detailed 
enumeration of the most idealized material 
realizations across all stoichiometric materials 


in the ICSD (text S11). Although we find that 
there do not exist materials in which the 
entire valence manifold is fragile topological, 
we do find numerous examples of experimen- 
tally accessible materials with well-isolated 
fragile bands close to Fy (see text SIIE). 
We begin by summarizing the symmetry- 
indicated topological properties of the stoi- 
chiometric materials in the ICSD. First, we 
discuss statistical trends uncovered across 
the materials analyzed in this work, includ- 
ing the distribution of topological features 
at E, and statistics for SOC-driven topological 
phase transitions, which are defined by com- 
paring the stable topology at E; for each ICSD 
entry in calculations performed with and with- 
out incorporating the effects of SOC. We then 


highlight material candidates in which our 
investigations have uncovered previously un- 
recognized topological features at experimen- 
tally accessible energies. 


Material statistics 


At the time of our investigations, the ICSD (4) 
contained 193,426 entries, of which 96,196 char- 
acterized stoichiometric chemical compounds 
with processable (noncorrupt) CIF structure 
files. We performed first-principles calcula- 
tions on all 96,196 processable stoichiomet- 
ric entries, resulting in convergent electronic 
structures for 73,234 entries in the presence 
of SOC, which we grouped into 38,298 unique 
materials by common chemical formulas and 
stable topology at E; (see text S3A). In Fig. 2A, 
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Fig. 2. Topology of the stoichiometric materials in the ICSD. (A) Distribution 
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either stable or fragile Sls [see (1, 16, 17, 29-36, 45, 46)]. Because previous 


of the topology at Er with SOC among the 38,298 stoichiometric unique materials 
in the ICSD as defined in text $4, subdivided by crystallographic point group 
(PG). For each of the 32 PGs of the 230 space groups (56), we list the number of 
symmetry-indicated stable topological (crystalline) insulators (NLC- or SEBR- 
classified Tls and TCls), TSMs (ES or ESFD), or unique materials with trivial 
symmetry-indicated topology (LCEBR). (B) The distribution of the topology at 
E¢ across all PGs and topological classes (37, 38) (text S2). Of the unique 
materials at Er, 16.00% are Tls or TCls and 36.65% are TSMs, implying that 

a remarkable 52.65% of the stoichiometric materials in nature are topological at 
intrinsic filling. (C) Distribution of the symmetry-indicated band topology away 
from E; across the stoichiometric unique materials in the ICSD. An overwhelming 
87.99% of materials contain at least one symmetry-indicated stable or fragile 
topological band across the energy range of our first-principles calculations 
(see text S4 for calculation details), which is even more notable when considering 
that many materials (e.g., noncentrosymmetric crystals) have SGs without 
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works have demonstrated the existence of non—symmetry-indicated Tl and 

TCI phases in materials with trivial stable Sls (1, 3, 13, 14, 34), then the 
percentage of materials in nature with topological bands at and away from Er 
is necessarily even larger than 87.99%, suggesting an intriguing direction for 
future study. In addition to the repeat-topological and supertopological materials 
(Fig. 4 and text S9), we have also discovered the existence of supermetallic 
(SMetal) materials, in which all of the bands above the core shell are connected 
up to at least a filling of 2N., where N, is the number of valence electrons 
(texts S6 and S8). Although SMetal materials are relatively rare in the presence 
of SOC [17 unique materials; see (C)], there are 1138 unique SMetal materials 
without SOC, which is consistent with the general trend of increased band 
connectivity in materials when neglecting the effects of SOC (Table 1 and texts 
S6, S8, and S10). In text S8, we provide further detailed statistics for the 
symmetry-indicated band topology at and away from Ef across all of the 
stoichiometric materials in the ICSD. 
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we show the distribution of the topology at 
Ey for the unique materials within each of 
the 32 crystallographic point groups of the 
230 nonmagnetic SGs, and in Fig. 2B, we show 
the total number of unique materials in each 
topological class. We find that at intrinsic 
filling, 6128 unique materials are symmetry- 
indicated stable TIs or TCIs, and 14,037 unique 
materials are TSMs. Our findings represent a 
doubling of the number of known topologi- 
cal materials from those identified in pre- 
vious high-throughput calculations (38-40). 
Furthermore, we find that 52.65% of unique 
materials are symmetry-indicated stable TIs, 
TCIs, or TSMs at Ep, which also represents a 
doubling from previous estimates of the per- 
centage of 3D topological materials in nature 
(38, 39). This percentage is considerably 
higher than the percentage of topological 
stoichiometric 2D materials, which has previ- 
ously been computed to lie within the range of 
a few to 20% (61, 62). 

In texts S5 and S8, we respectively provide 
detailed statistics for the computational re- 
sources expended for the calculations per- 
formed for this work, and for the topology at 
and away from £, across the stoichiometric 
materials in the ICSD. In the presence of 
SOC, we find that an overwhelming 87.99% 
of unique materials contain at least one 
symmetry-indicated stable or fragile topo- 
logical band (Fig. 2C). This percentage is all 
the more surprising, as we have included mate- 
rials whose SGs have trivial stable SI groups 
[e.g., noncentrosymmetric crystals without 
high-fold rotoinversion symmetries (16, 17, 29)], 
such that all isolated bands in the SG are 
classified by SIs as trivial or fragile. Taking all 
38,298 unique materials into consideration— 
including the materials in SGs with trivial 
stable SI groups (16, 17, 29)—we find that 
nearly two-thirds of all energetically isolated 
bands in nature exhibit symmetry-indicated 
stable topology. Hence, our investigations 
reveal that even away from Fy, the majority 
of electronic features in solid-state materials 
can only be robustly modeled by incorporating 
topological band theory. 

Lastly, we have also computed detailed 
statistics for the relative topology of each 
stoichiometric ICSD entry with and without 
incorporating the effects of SOC. As discussed 
above and in text S10 and demonstrated in 
(29, 31, 34, 36), all symmetry-indicated, non- 
magnetic stable topological phases without 
SOC are TSMs. Hence, in the absence of SOC, 
there are ESFD- and ES-classified TSMs—in 
which the nodal degeneracies respectively lie 
at high-symmetry points (ESFD) and along 
high-symmetry lines and planes (ES)—as well 
as SEBR-SM- and NLC-SM-classified TSMs, 
in which the bands along all high-symmetry 
lines and planes satisfy the insulating com- 
patibility relations (28, 58) and the nodal 
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degeneracies lie in the BZ interior. In Table 1, 
we show the number of unique materials in 
each topological class with and without SOC. 
Table 1 indicates that the majority of ES-, 
NLC-SM-, and SEBR-SM-classified TSMs with- 
out SOC become stable TIs and TCIs (NLC- and 
SEBR-classified) when SOC is introduced, 
whereas ESFD-classified TSMs without SOC 
overwhelmingly remain TSMs (ES- and ESFD- 
classified) when the effects of SOC are in- 
corporated. In text S10, we provide complete 
statistics for the SOC-driven topological phase 
transitions at E; in the ICSD; in text S11D, 
we identify representative materials for each 
class of SOC-driven TSM-insulator transition 
in Table 1. 


Material candidates 


We next highlight the materials in which our 
investigations have revealed or provided a 
different context for topological features 
within an experimentally accessible range 
from Ey; an extensive tabulation of the most 
idealized realizations in nature of all previously 
established classes of symmetry-indicated TI, 
TCI, and TSM phases in the presence of SOC 
is further provided in text S11. To begin, in 
Fig. 3, we present experimentally favorable 
material candidates with stable and fragile 
topology at Fy. First, in Fig. 3A, we show the 
electronic band structure of MoGe, [ICSD 
76139, SG 139 4/mmm)]—a higher-order- 
topological, centrosymmetric TSM (59) in 
which the doubly degenerate valence and 
conduction bands that meet along I'M, in 
fourfold Dirac points at Fp, along with the 
next-highest doubly degenerate valence bands, 
are as a Set fragile topological (i.e., the two con- 
duction bands and four valence bands closest 
to Ey in Fig. 3A as a set exhibit nontrivial fragile 
SIs). Next, in Fig. 3B, we highlight HgBa,CuO, 


[ICSD 75720, SG 123 (P4/mmm)]—a well- 
studied high-temperature cuprate super- 
conductor (63), which we find to be an ideal 
ESFD-classified metal with a doubly degen- 
erate, half-filled band at E;. Then, in Fig. 3, C 
and D, we respectively show the band structures 
of the transition metal chalcogenides (TMCs) 
TaSe [ICSD 24313, SG 164 (P3m1)] and TiS, 
[ICSD 72042, SG 227 (Fd3m)]. Both TaSe, and 
TiS, host well-established 2D charge density 
wave phases (64-66); in this work, we discover 
that the valence bands extending to ~3 eV 
below Ey in 3D TaSe, are fragile topological, as 
are the conduction bands extending to ~1.5 eV 
above Er in 3D TiS». 

In Fig. 3, E and F, we respectively show 
the electronic band structures of the closely 
related TMCs Ta,NiSe; [ICSD 61148, SG 15 
(C2/c)] and Ta,NiSe, [ICSD 61352, SG 12 (C2/ 
m)], which show particular promise for ex- 
perimental investigations owing to their rel- 
atively simple normal-state Fermi surfaces. 
Ta ,NiSe,; and Ta,NiSe, belong to a larger 
family of materials previously highlighted 
for hosting metal-insulator transitions and 
strongly correlated phases of matter. Previous 
investigations have demonstrated that the 
layered TMC Ta,NiSe; hosts an exotic exciton- 
insulator phase (67, 68). By comparing the 
parity (inversion) eigenvalues of Ta,NiSe; 
with and without incorporating the effects 
of SOC (Fig. 4, A and B), we find that the 
narrow-gap semiconducting state of Ta,NiSe; 
in fact realizes a 3D TI phase originating from 
weak, SOC-driven band inversion at the 
point. Unlike Ta,NiSe;, Ta,NiSe, (Fig. 3F) is 
instead a quasi-1D TMC and has been pre- 
viously highlighted for exhibiting a charge 
density wave instability (69). Ta.NiSe;, is 
closely related to the ES-classified, structurally 
chiral, quasi-1D TMC Wey] semimetal TaISeg 
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Table 1. SOC-driven topological phase transitions in the unique stoichiometric materials in 
the ICSD. We show the statistics for the band topology at E; for the unique stoichiometric materials 
in the ICSD with and without incorporating the effects of SOC in first-principles calculations. Rows 
list the topological classification at Er in the absence of SOC; columns list the topological classification 
at Ef in the presence of SOC. We find that nearly two-thirds of ES-classified unique materials without 
SOC become stable Tls and TCls with SOC, whereas most ESFD TSMs without SOC remain 
semimetallic when SOC is incorporated. Conversely, more than 85% of NLC-SM- and SEBR-SM- 
classified TSMs without SOC [see (31, 34, 36) and text S6] become stable Tls and TCls after 
incorporating SOC. Complete statistics for the SOC-driven topological phase transitions at Er in the 
ICSD are provided in text S10, and representative materials for each class of TSM-insulator phase 
transition are provided in text S11D. 
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Fig. 3. Material candidates with topological features at E-. We highlight a doubly degenerate, half-filled band at Er. (© and D) In the transition-metal 
materials with previously unidentified or recontextualized topological chalcogenides (TMCs) TaSez [ICSD 24313, SG 164 (P3m1)] and TiS» [ICSD 
features at Er. In all panels, bands with symmetry-indicated stable and trivial 72042, SG 227 (Fd3m)|—previously determined to host 2D charge density wave 
topology are plotted in black, symmetry-indicated fragile valence bands in phases (64—-66)—the highest valence and lowest conduction bands are 
blue, and symmetry-indicated fragile conduction bands in red. (A) MoGez respectively fragile topological. (E) TazNiSes [ICSD 61148, SG 15 (C2/c)] is 
ICSD 76139, SG 139 (/4/mmm)) is a higher-order-topological, centrosymmetric a layered TMC that has been demonstrated to host an exciton-insulator phase 
Dirac TSM (59) in which the doubly degenerate valence and conduction (67, 68); we find that the narrow-gap semiconducting state of TagNiSes is in 
bands that meet along I'M; in fourfold Dirac points at Er, along with the fact a 3D TI. (F) Conversely, the closely related quasi-ID TMC TaNiSe7 [ICSD 
next-highest doubly degenerate valence bands, are as a set fragile topological. 61352, SG 12 (C2/m)]|—which we find to also be a 3D Tl—exhibits a charge 
(B) HgBazCuO, [ICSD 75720, SG 123 (P4/mmm)]—an established high-T, density wave instability (69). In Fig. 4, we provide further analysis of the band 
superconductor (63)—is found to be an ideal ESFD-classified metal with ordering in TazNiSes and TazNiSe7. 
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respectively neglecting and incorporating the effects of SOC. Unlike in TasNiSes 


We analyze the band ordering of TazNiSes [ICSD 61148, SG 15 (C2/c)] and 
TagNiSe7 [ICSD 61352, SG 12 (C2/m)] with and without SOC. In all panels, the 
red + and — symbols respectively indicate Kramers pairs of states with positive 
and negative parity (inversion) eigenvalues. (A and B) The electronic band 
structure of TagNiSes respectively neglecting and incorporating the effects of 
SOC. Without SOC (A), TasNiSes exhibits trivial symmetry-indicated stable 
topology at Fr (i.e., TagNiSes is LCEBR-classified without SOC). When SOC is 
reintroduced in (B), it drives a band inversion at I, transitioning TazNiSes into a 
Z4 = 3 3D TI phase. (C and D) The electronic band structure of TazNiSe7 


(A), the bands at T and A (k,, = 0, k, = 0, m, respectively) in TazNiSe7 (C) are 
already inverted by orbital coupling in the xy- (ab-) plane, taking the TazNiSe7 
chains to be oriented along the z- (c-axis) direction (see ICSD 61352). Hence, 
without SOC, TazNiSe7 realizes an ES-classified nodal-line-semimetal phase with 
the same parity eigenvalues as a z-directed weak TI [(Z21, 22.2, 22,3, Z4) = (0012) 
when subduced onto SG 2 (P1) (text S6B) (5, 30, 31, 36)]. When SOC is 
reintroduced, the nodal lines at Er in TazNiSe7 become gapped, the band 
inversion at A is removed by SOC, and the bands at I remain inverted, overall 
driving TasNiSe7 into a Z, = 3 3D TI phase. 


[ICSD 35190, SG 97 (1422); see text S11C], 
which was shown in recent experimental work 
to exhibit a topological (axionic) charge 
density wave that competes with a super- 
conducting phase under applied pressure 
(26, 27, 70). In this work, we find that the 
room-temperature phase of TagNiSe, is a 3D 
TI originating from a combination of aniso- 
tropic orbital coupling and SOC-driven band 
inversion. Specifically, we find that without 
SOC, Ta,NiSe, is an ES-classified nodal-line 
semimetal with band inversions at k,, = 0, 
k, = 0, m (respectively the T and A points in 
Fig. 4C) driven by orbital coupling in the xy- 
(ab-) plane [taking the Ta,NiSe, chains to be 
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oriented along the z- (c-axis) direction; see 
ICSD 61352]. This can be seen by recognizing 
that Ta,NiSe, without SOC (Fig. 3C) exhibits 
the same parity eigenvalues as a g-directed 
weak TI [(20,1, 22,9, 22,3, 24) = (0012) when 
subduced onto SG 2 (P1); see text S6B and 
(5, 30, 31, 36)]. When SOC is reintroduced, the 
nodal lines at Ey in Ta,NiSe, become gapped, 
and the band inversion at.A is removed, where- 
as the bands at I remain inverted (see Fig. 4D). 

Finally, we performed a high-throughput 
search for topological bands away from Eg. 
Our calculations have revealed the existence of 
two additional classes of topological materials: 
RTopo materials with stable topological in- 


20 May 2022 


sulating gaps at and just below £; (Fig. 5A and 
text S9A), and STopo materials in which every 
energetically isolated set of bands above the 
core shell exhibits symmetry-indicated stable 
topology (Fig. 5B and text S9B). From a phys- 
ical perspective, STopo phases result from a 
combination of EBR splitting, band backfold- 
ing from unit cell enlargement, and favorable 
hopping parameters—hence, the identification 
of an STopo phase in a particular material can 
be sensitive to sample and calculation details. 
In text SOB, we highlight the example of the 
experimentally established higher-order TI 
rhombohedral bismuth [ICSD 64703, SG 166 
(R3m)] (22), for which some ICSD entries (e.g., 
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Fig. 5. Repeat-topology and 
supertopology in Bi2Mg3. 


A 


(A) A schematic depiction of a 
repeat-topological (RTopo) band 
structure. In RTopo materials, 

the gap at the Fermi level, as well 
as the next gap below Fr as 
measured by band connectivity 
through TQC (text S2), exhibit 
cumulative symmetry-indicated 
stable topology (text S9A). By this 
definition—which is motivated by 


B= 8; 


Topological 
Topological 


Topological 
or Trivial 


Topological 


the experimental accessibility 

of topological gaps and boundary 
states below Er—we note that 

it is possible for the isolated 
bands between Er and the next- 
highest gap below Fr to be 
topologically trivial. (B) A sche- 
matic depiction of a supertopo- 
ogical (STopo) band structure. In 
STopo materials, every energet- 
ically isolated set of bands in the 
spectrum exhibits symmetry- 
indicated stable topology 
(although the system at Er is 

still free to be a TSM or exhibit 
cumulative trivial topology; 
see text S9B). (C) Bulk band 
structure of Bi2Mg3 [ICSD 659569, 
SG 164 (P3ml)], an experimentally 
established 3D TI (72) revealed 
by our investigations to addition- 
ally be RTopo and STopo. Th 
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Z4 [see (16, 17, 29, 30, 33, 34) and text S11 for the physical meaning of the stable Sls in SG 164 (P3ml)]. (D) (0001)-surface states of BizMg3 obtained from 
surface Green's functions (see text S9A for calculation details). We have labeled the surface states using the cumulative stable Sls of the projected bulk gaps 


(as determined by band con 


nectivity; see table S2 for details). Previous ARPES investigations of Bi2Mg3 have revealed the existence of “surface resonance bands” 


below Er at the (0001)-surface I point (72). We find that the surface resonance bands in BizMg3 are in fact RTopo Dirac-cone surface states in the first gap below 
E, (the “0013 RTopo gap”). The surface Green's function calculations exhibit additional twofold-Dirac cone su 
below E¢ (the “0010 WTI gap”). Although the earlier ARPES experiments also observed surface states in the pro} 


surface states in the 0010 gap in (D) are either trivia 
characterize an (obstructed) weak T| phase that does 


ICSD 64703) are STopo, whereas others (e.g., 
ICSD 53797) are not. Crucially, Bi ICSD 64703) 
and Bi (ICSD 53797) differ only by weak band 
inversion away from Ey, and both exhibit 
topologically nontrivial gaps at Ey; and just 
below Ey. This provides a physical motivation 
for introducing a second topological class— 
RTopo—for materials like Bi (ICSD 64703). In 
idealized RTopo materials, there exist two sets 
of surface or hinge states within consecutive 
bulk gaps that both lie at energies accessible to 
ARPES probes without doping (~1.5 eV below 
Erg), a property that is crucially insensitive to 
band inversions at experimentally inaccessible 
energies far from Ey. The repeated topological 
surface and hinge states of RTopo materials 
are analogous to the Fermi-are “quantum 
ladder” recently observed in the unconven- 
tional chiral semimetal alloy Rh,,Ni,Si (71) 


(see text S9A). Lastly, we emphasize that 
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or originate from non-symmetry-indicated stable topo 


rface states within the projected bulk gap ~0.8 eV 
jected 0010 gap in Bi2Mgs (72), we emphasize that the 
ogy, because the cumulative stable Sls (0010) 


not generically exhibit anomalous twofold-Dirac cones on z-normal [(0001)-] surfaces (see text S11A6). 


in principle, the STopo and RTopo classifi- 
cation of materials can be extended from 
stable topological states to fragile topology; 
however, we did not in this study encounter 
any ICSD entries with super- or repeat-fragile 
topology. 

In Fig. 5, C and D, we show the bulk and 
surface states of the RTopo and STopo mate- 
rial BizMgs [ICSD 659569, SG 164 (P3m1)]. 
Previous theoretical investigations and spec- 
troscopic experiments have determined Bi,.Mg, 
to be a3D TI with “surface resonance bands” 
below Ey at the (0001)-surface T’ point (72). 
We find that the surface resonance bands 
detected in (72) are in fact the RTopo Dirac- 
cone surface states of Bi.Mgs in the first gap 
below EF; (the “0013 RTopo gap” in Fig. 5D). In 
texts S11B and 9C, respectively, we provide 
detailed tabulations of RTopo and STopo mate- 
rials in the ICSD. Taken together, the large 
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number of experimentally accessible materials 
in the ICSD with stable topological bands 
away from Ey (see text S8), as well as the 
relative preponderance of RTopo and STopo 
materials, suggest that many of the surface 
resonances detected in previous ARPES inves- 
tigations may in fact be topological surface 
states protected by spectral flow below E£,. 


Discussion 


We have performed a complete calculation 
of symmetry-indicated stable and fragile 
band topology at and away from Ey across 
all known stoichiometric, solid-state, non- 
magnetic materials. We have discovered that 
more than half of the materials in nature are 
stable TIs and TSMs at Fy, and nearly two- 
thirds of bands across all materials exhibit 
symmetry-indicated stable topology. Our in- 
vestigations raise several questions. First, there 
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exist well-established classes of noncentro- 
symmetric TI and TCI phases that cannot 
be diagnosed through SIs. For example, de- 
pending on experimental or first-principles 
calculation details, WTez [ICSD 14348, SG 
31 (Pmn2,)] can realize an LCEBR-classified 
TSM phase with tilted Weyl cones (73), or a 
non-symmetry-indicated twofold-rotation- 
anomaly TCI phase (34). Additionally, like 
stable topological phases with trivial SIs, there 
also exist non-symmetry-indicated fragile 
phases that evade the methods used in this 
work (59). It remains an important outstand- 
ing question whether computationally efficient 
methods beyond TQC and SIs can be for- 
mulated for the high-throughput discovery of 
non-symmetry-indicated topological materials. 
Several promising studies have recently intro- 
duced alternative, scalable methods for high- 
throughput topological materials identification 
(74, 75), suggesting a future in which an even 
larger percentage of materials in nature are 
recognized as topologically nontrivial. Fur- 
thermore, while we have focused our efforts 
on nonmagnetic topological materials, recent 
studies have expanded the methods of TQC 
and SIs to magnetic systems (29, 35), facilitat- 
ing the high-throughput identification of more 
than 100 magnetic materials with nontrivial 
topology at E, (37). Following the results of 
this work, future investigations may be directed 
toward identifying nontrivial electronic band 
topology away from Fy in magnetic materials, 
in particular targeting bands in which the 
topology and magnetism are coupled and ex- 
perimentally tunable. 

Lastly, we note that during the final stages of 
preparing this work, the 3D TMCs Ta ,Pd3Te,; 
and Ta,Ni;Te;—which are closely related to 
the TMCs Ta,NiSe; and Ta,NiSe, discovered 
to be 3D TIs in this work—were theoretically 
predicted in (76) to be non-symmetry-indicated 
mirror TCIs. Experimental evidence of a 2D 
TI phase in Ta,Pd3Te,; monolayers was also 
reported in (77). 


Materials and methods 


The first-principles calculations in this work 
were performed using the Vienna Ab-initio 
Simulation Package (VASP) (78-80). We 
treated the interaction between the ion cores 
and the valence electrons using the projector 
augmented-wave method (87), and we used 
the generalized gradient approximation with 
the Perdew-Burke-Ernzerhof parameteriza- 
tion for the exchange-correlation potential 
(82). For each material calculation, we used as 
input the structural parameters reported on 
the ICSD (42). For calculations incorporating 
the effects of SOC, we accounted for the effects 
of SOC using the second variation method 
(83). For the self-consistent calculations (SCC) 
of the charge density and the density of states, 
we used a grid of 11 x 11 x 11 & points centered 
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at the T' point (& = 0). For the band structure 
calculations along high-symmetry lines and 
planes, we then used the input from the SCC. 
Each segment of the & path contained 20 k 
points. At each & point along each path seg- 
ment, we specifically calculated the energies 
of at least 2N. Bloch states, where N, is the 
number of valence electrons in the primitive 
cell; we did not include bands originating 
from core-shell atomic orbitals. To identify 
the symmetry-indicated stable and fragile 
topological bands in each analyzed material, 
we then used the VASP2Trace (54, 84) program 
and an implementation of the Check Topo- 
logical Mat (55) program updated for this 
work (see text $3 for details). Lastly, to com- 
pute the (0001)-surface spectrum of Bi,.Mg; in 
Fig. 5D, we used WANNIER90 (85) to con- 
struct a Wannier-based tight-binding model 
from the s and p orbitals of Mg and the p 
orbitals of Bi, which we found to accurately 
reproduce the bulk electronic band structure 
of BizMgs (see text S9A for further calcula- 
tion details). 
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NEURODEVELOPMENT 


Oligodendrocyte precursors guide interneuron 
migration by unidirectional contact repulsion 


Fanny Lepiemme, Julie Stoufflet, Miriam Javier-Torrent, Gabriel Mazzucchelli, Carla G. Silva*{, Laurent Nguyen*+ 


INTRODUCTION: The cerebral cortex is an evo- 
lutionarily advanced brain region made of cel- 
lular layers tangentially organized into areas 
that serve higher cognitive functions. During 
development, most cortical interneurons (here- 
after called interneurons) are born in the ventral 
forebrain, where some progenitors concomi- 
tantly generate oligodendrocyte precursor cells 
(OPCs). OPCs differentiate into oligodendro- 
cytes, the principal functions of which are to 
wrap axons with myelin to support the rapid 
saltatory conduction of action potentials and 
to provide metabolic support to neurons in the 
postnatal brain. Three distinct populations of 
OPCs have been identified in the forebrain. 
These cells are born as successive waves with a 
defined spatiotemporal pattern, and they all 
migrate from their birthplace to colonize the 
brain. The two initial waves are born in the 
ventral forebrain and named vOPCs. They mi- 
grate together with the interneurons toward 
the cortex, whereas the third OPC wave is gen- 
erated locally around birth by some progenitors 
of the cortical wall. Despite their distinct origin, 
all OPC lineages converge at the transcrip- 
tional level, which may reflect their contribu- 
tion to myelination. 


RATIONALE: While migrating, some neural cells 
promote transient cellular interactions that 
confer upon them roles additional to those 
played once integrated into the cortical net- 
work. Accordingly, recent studies have reported 
that some first-wave vOPCs establish synaptic 
contacts with lineage-related interneurons dur- 
ing early postnatal periods. It is intriguing that 
a large fraction of the first-wave vOPCs do not 
contribute to synaptogenesis and are elimi- 
nated during the second postnatal week. Here, 
we assessed whether vOPCs play additional 
noncanonical functions during brain develop- 
ment by testing whether they cross-talk with 
interneurons to support their concomitant 
migration to the cerebral cortex. 


RESULTS: We hypothesized that some early 
functions of vVOPCs would rely on their spa- 
tiotemporal origin, further defining their field 
of possible interactions with neighboring cells. 
Despite being born in shared germinal regions 
of the ventral forebrain, vVOPCs and interneurons 
occupy mutually exclusive territories while 
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migrating. Live imaging of both cell popula- 
tions in brain slices from mouse embryos 
showed that first-wave vOPCs and interneurons 
use distinct migration strategies to reach the 
cortex. Cortical interneurons navigate into 
organized streams within the parenchyma, 
whereas first-wave vOPCs prefer migration 
along blood vessels. The ordered migration of 
interneurons in cortical streams relies mostly 
on gradients of chemoattractant molecules 
such as Cxcl12, to which interneurons are 
responsive through their expression of the 
Cxcr4 receptor. This chemokine is also abun- 
dantly released by endothelial cells covering 
the complex and dense network of growing 
blood vessels that perfuse the ganglionic emi- 
nences and later the cortical wall. A long- 


UCoRe 


Sema6a-b Plxna3 


Unidirectional contact repulsion provides guidance 
to migrating interneurons. During embryogenesis, 
interneurons and first-wave vOPCs are attracted by the 
Cxcll2 released by blood vessels. By migrating along 
these structures, first-wave vOPCs prevent contact 
between interneurons and blood vessels through 
unidirectional contact repulsion. This mechanism pro- 
vides guidance to migrating interneurons and relies on 
the interaction between Sema6a/6b ligands and Plxna3 
eceptors expressed by first-wave vOPCs and inter- 
neurons, respectively. LGE, lateral ganglionic eminence; 

GE, medial ganglionic eminence; POA, preoptic area; 
UCoRe, unidirectional contact-repulsion event. 


standing question in the field of brain devel- 
opment is how cortical interneurons cope 
with different sources, and thus gradients of 
Cxcl12, to properly navigate within the cortex. 
Here, we show that first-wave vOPCs, which 
are also attracted by Cxcl12, migrate along 
blood vessels and promote unidirectional con- 
tact repulsion to repel migrating interneurons 
from blood vessels. This cellular mechanism, 
which is not shared by second-wave vOPCs, 
relies on the expression of the semaphorin 6a/ 
6b by first-wave vOPCs that binds and acti- 
vates the plexin A3 receptors expressed at the 
surface of interneurons. This atypical signal- 
ing triggers interneuron polarity reversal, pre- 
venting them from clustering around blood 
vessels and allowing them to follow the corti- 
cal Cxcl12 gradient (released by intermediate 
progenitors and meningeal cells) to reach and 
settle in their cortical layer in a timely manner. 
Moreover, a prolonged interaction of migrat- 
ing interneurons with cells that release high 
levels of Cxcl12, such as endothelial cells, may 
not only bias their migration directionality but 
also reduce their motility through a sustained 
activation of Cxr4 receptors by Cxcl12, leading 
to their internalization. 


CONCLUSION: During brain development, vOPCs 
and interneurons are born in the ventral fore- 
brain and migrate concomitantly along tan- 
gential routes to reach the cerebral cortex. Here, 
we show by coupling histological analysis of 
mouse genetic models with live imaging that, 
while being responsive to Cxcl12 gradients, both 
cell populations occupy mutually exclusive 
forebrain territories enriched in this chemokine. 
Live-imaging analyses demonstrated that 
first-wave, but not second-wave, vOPCs per- 
form unidirectional contact repulsion on inter- 
neurons. This mechanism steers interneurons 
away from blood vessels that release Cxcl12, 
thereby allowing them to follow cortical gra- 
dients of this chemokine to later settle in their 
cortical layer. This mode of contact repulsion 
is specific to first-wave vOPC-interneuron pairs 
and distinct from self-repulsion. It relies on 
the activation of an atypical semaphorin-plexin 
signaling that induces directional change of 
interneurons upon their polarity reversal. 
Whether the specificity of this interaction 
relies on the degree of interneuron matura- 
tion and/or the signaling toolbox of vVOPCs 
belonging in two distinct waves remains to 
be determined. 
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NEURODEVELOPMENT 


Oligodendrocyte precursors guide interneuron 
migration by unidirectional contact repulsion 


Fanny Lepiemme!, Julie Stoufflet’, Miriam Javier-Torrent’, Gabriel Mazzucchelli2, 


Carla G. Silva}, Laurent Nguyen?*+ 


In the forebrain, ventrally derived oligodendrocyte precursor cells (vOPCs) travel tangentially toward the 
cortex together with cortical interneurons. Here, we tested in the mouse whether these populations 
interact during embryogenesis while migrating. By coupling histological analysis of genetic models 
with live imaging, we show that although they are both attracted by the chemokine Cxcl12, vOPCs and 
cortical interneurons occupy mutually exclusive forebrain territories enriched in this chemokine. 
Moreover, first-wave vOPC depletion selectively disrupts the migration and distribution of cortical 
interneurons. At the cellular level, we found that by promoting unidirectional contact repulsion, first- 
wave vOPCs steered the migration of cortical interneurons away from the blood vessels to which they 
were both attracted, thereby allowing interneurons to reach their proper cortical territories. 


ligodendrocytes differentiate from oligo- 

dendrocyte progenitor cells (OPCs) and 

populate the entire central nervous sys- 

tem. After birth, they progressively wrap 

axons to support rapid saltatory conduc- 
tion of action potentials as well as to provide 
metabolic support to neurons through axo- 
myelinic channels (7). Some OPCs convert into 
neural/glial antigen 2 (NG2)-expressing glia, 
which is the most proliferative pool of cells in 
the mature brain (2). These cells also form syn- 
apses with neurons (3-5). OPCs from different 
central nervous system regions are genetically 
distinct, and several intermediates coexist dur- 
ing development (6), but some further con- 
verge into similar transcriptional states after 
birth (7). In the forebrain, OPCs have multi- 
ple origins and are generated in successive 
“spatiotemporal waves” that populate the cor- 
tex from embryonic development onward (8). 
The two first OPC waves are ventrally derived 
OPCs (vOPCs) that are produced in the ventral 
forebrain and are followed by a perinatal wave 
born in the cerebral cortex, which gradually 
expands and outnumbers the others (8). Al- 
though the second and third OPC waves mainly 
contribute to axon myelination, the function 
of the first-wave vOPC cohort remains elusive. 
This cell population was first thought to be 
transient and eliminated at birth (8), but de- 
spite undergoing massive cell death, recent 
single-cell analyses showed its maintenance 
and persistence in some brain regions during 
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adulthood (7). Most first-wave vOPCs share 
common embryonic origins with cortical in- 
terneurons (hereafter referred as interneurons). 
They are born from progenitors of the medial 
ganglionic eminence and the preoptic area, 
where genetic cross-inhibitory mechanisms 
control the balanced generation of interneu- 
rons and first-wave vOPCs. The expression of 
some Distal-less (Dlx) homeobox transcription 
factors drives interneuron fate by repressing 
the OPC gene Oligodendrocyte Transcription 
Factor 2 (Olig2) (9), whereas Oligodendrocyte 
Transcription Factor 1 (OligI) expression pro- 
motes the specification of first-wave vOPCs 
and prevents the acquisition of an interneuron 
fate by repressing DixI/2 expression (10). 
Some first-wave vOPCs from the preoptic area 
(Dbx1*) or from the medial ganglionic emi- 
nence (Nkx2.1*) cluster with their lineage- 
related interneurons in the cortex of newborn 
mice, where they form transient synapses be- 
fore engaging into myelination of specific local 
circuits (5, 11). During embryogenesis, vVOPCs 
and interneurons migrate concomitantly into 
the cerebral cortex (8, 12), a process during 
which these cells may interact and cooperate. 
We tested this hypothesis by performing live 
imaging on organotypic forebrain slices from 
transgenic mice to analyze the distribution 
and dynamic behavior of vOPCs and interneu- 
rons. We first showed that, despite coexisting 
in time and space, these cells occupy different 
territories and invade the cortex using distinct 
migration modes and paths. The elimination 
of first-wave vOPCs from the early forebrain 
results in changes of interneuron distribution 
and migration, supporting the existence of a 
functional cross-talk between these two cell 
types that starts at the onset of corticogenesis. 
At the cellular level, we found that first- but 
not second-wave vOPCs repel interneurons by 
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a process that we named unidirectional con- 
tact repulsion. This atypical behavior contrib- 
utes to the guidance of interneurons in streams, 
preventing them from clustering around blood 
vessels. VOPCs and interneurons are both at- 
tracted by the chemokine Cxcl12 released by 
blood vessels. However, vOPCs are the only 
ones to move along blood vessels, where they 
also divide (73). Here, we show that the fore- 
brain depletion of some first-wave vOPCs re- 
sults in interneuron clustering around blood 
vessels by locally increasing Cxcl12 availability. 
Altogether, our data demonstrate that by in- 
ducing unidirectional contact repulsion, first- 
wave vOPCs play a transient role in supporting 
interneuron migration in the developing brain. 
Unidirectional contact repulsion is thus a strat- 
egy allowing first-wave vOPCs and interneu- 
rons, which are both responsive to a common 
chemoattractant, to progress in a coordinated 
fashion on shared forebrain territories. 


Results 
Forebrain spatial segregation of interneurons 
and vOPCs 


We analyzed the dynamic distribution pattern 
of first-wave vOPCs (platelet-derived growth 
factor receptor A, Pdgfra*) (8) and most inter- 
neurons (calbindin*) that migrate in the early 
forebrain (/4). At embryonic day 11.5 (E11.5), 
both cell types mostly distribute into nonover- 
lapping subpallial territories (Fig. 1, A to D). 
First-wave vOPCs populate the germinal zones 
of the medial ganglionic eminence/preoptic 
area, whereas interneurons are distributed 
throughout their subventricular and mantle 
zones (SVZ and MAZ, respectively) (Fig. 1, A 
and B). At E13.5, first-wave vOPCs accumu- 
late in the SVZ and along the MAZ against a 
stream of migrating interneurons (Fig. 1, E to 
H). Calbindin* interneurons born in the me- 
dial ganglionic eminence organize into a deep 
migratory stream (DMS), and those derived 
from the preoptic area form a superficial mi- 
gratory stream (SMS) (Fig. 1E) (15). Most first- 
wave vOPCs do not integrate these migratory 
streams, suggesting that during early embry- 
onic periods, first-wave vOPCs and interneu- 
rons are spatially segregated in the subpallium. 
This was further confirmed by the analysis of 
maximum fluorescence intensities and rela- 
tive number of cells expressing Pdgfra or cal- 
bindin at E115 (Fig. 1, C and D) and at E13.5 
(Fig. 1, G and H), supporting the spatial segre- 
gation of first-wave vOPC and interneuron 
populations in the ventral forebrain. We showed 
that vOPCs occupying the medial ganglionic 
eminence/preoptic area territory above the 
DMS mainly arise from Nkx2.1* (87.72 + 5.04%) 
and Dbx1"* (31.23 + 2.05%) progenitors (Fig. 1, I 
to L). We next analyzed the distribution of 
first-wave vOPCs and interneurons in the cor- 
tical wall of E13.5 embryos and followed their 
migration in transgenic mice [D1x5,6:Cre-GFP 
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for interneurons, and Sox10:loxGFP-STOPlox- 
DTA (further referred as Sox10:GFP-DTA) for 
vOPCs, which largely coexpress calbindin and 
Pdgfra, respectively; fig. S1, A to C]. Although 
interneurons migrate within two main tan- 
gential streams located in the marginal zone 
(MZ) and the intermediate zone (IZ), vVOPCs 
show erratic distribution within the cortical 
wall (Fig. IM and movies S1 and S2). At the 
cellular level, both interneurons and vOPCs 
undergo saltatory migration, but VOPCs move 
at a slower speed as a result of a reduced nu- 
cleokinesis frequency and amplitude (fig. $1, D 
to F) combined with longer pausing time of 
their nucleus (fig. SIG). At E16.5, numerous 
interneurons undergo intracortical dispersion, 
whereas most vOPCs accumulate under the 
cortical plate, where NCAM-L1* thalamocor- 
tical fibers extend (Fig. 1, N and O). 


Tamoxifen 


Depletion of vOPCs alters forebrain 

interneuron distribution 

We next wondered whether the spatial segre- 
gation of interneurons and first-wave vOPCs 
in the forebrain results from responsiveness to 
different cues or if it arises from their mutual 
interaction. To investigate these two possibilities, 
we crossed Pdefra-Cre“*? and Sox10:GFP-DTA 
mice to generate embryos devoid of first-wave 
vOPCs (OPC-depleted embryos) upon succes- 
sive (E11.5 and E12.5) intraperitoneal injections 
of tamoxifen in pregnant dams. The depletion 
of first-wave vOPCs was confirmed at E13.5 
and E14.5 (Fig. 2A and fig. S2, A and B). This 
procedure did not induce bystander cell death, 
changes in ganglionic eminence progenitor pro- 
liferation, or changes in interneuron numbers 
(fig. S2, C to I). The forebrain of E14.5 OPC- 
depleted embryos showed enlarged DMS as a 


result of an increased dispersion of calbindin* 
interneurons (Fig. 2, B and C). Analysis of the 
dorsal forebrain of E13.5 and E16.5 embryos 
showed a reduced number of calbindin* inter- 
neurons into cortical streams upon first-wave 
vOPC depletion (Fig. 2, D to G). The acute loss 
of first-wave vOPCs did not prevent later-born 
second-wave vOPCs from invading the cortical 
wall of OPC-depleted embryos at E16.5 (fig. S2, 
J and K). We next crossed OPC-depleted mice 
with D1x5,6:Cre-GFP mice to generate green 
fluorescent protein-positive (GFP*) cortical 
interneuron (cIN)-OPC-depleted embryos and 
followed the migration of GFP* interneurons, 
which included most future cortical interneu- 
ron subtypes (Fig. 2A). The number of GFP* 
interneurons in E13.5 GFP*cIN-OPC-depleted 
embryos was reduced in the cortical wall, sug- 
gesting that the loss of first-wave vOPCs affects 
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Fig. 2. Depletion of first-wave vOPCs alters interneuron distribution in the 
forebrain. (A) Mouse breeding strategies to deplete Pdgfra”:Sox10* first-wave vOPCs 
through conditional expression of DTA upon tamoxifen injection (OPC-depleted) 

and to visualize DIx5,6-GFP* interneurons (GFP*cIN-OPC-depleted). Immunolabeling 
of brain slices from E13.5 siblings Pdgfra-Cre“®" (Control) and Pdgfro-Cre“®";Sox10: 
GFP-DTA (OPC-depleted) embryos shows complete loss of first-wave vOPCs 

(green) in the medial ganglionic eminence/preoptic area after tamoxifen injection in 
OPC-depleted embryos. Right, immunolabeling and quantification (n = 13 embryos, 
100% of depletion). Nuclei were counterstained with DAPI. (B) Immunolabeling 

of interneurons (red) in the subpallium of £14.5 control and OPC-depleted embryos. 
Migrating interneurons are organized into a DMS stream, which is enlarged in 
OPC-depleted embryo at three rostrocaudal levels. (©) Quantifications of DMS width 
in its upper part at three rostrocaudal levels, as illustrated in (B) (n = 4 embryos; 
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****P < (0001, two-way ANOVA). (D) Immunolabeling of interneurons (red) in 
the cortical wall of E13.5 control and OPC-depleted embryos. The number of 
migrating interneurons in the MZ and IZ streams is decreased upon first-wave 
vOPC depletion. (E) Quantification of the number of interneurons (red) in 
rostrocaudal regions of the cortex of E13.5 control and OPC-depleted embryos 
(n= 13 to 14 embryos; **P = 0.0024, ***P = 0.0004, ***P = 0.0003, respectively, 
two-way ANOVA). (F) Immunolabeling of interneurons (red) in the cortical wall 
of E16.5 control and OPC-depleted embryos. The number of interneurons (red) is 
decreased in the CP. (G) Quantification of the number of interneurons (red) in 
the rostrocaudal regions of the cortex of E16.5 control and OPC-depleted embryos 
(n = 6 to 7 embryos; ****P < 0.0001, ***P = 0.0007, ****P < 0.0001, two- 

way ANOVA). Scale bars: (A) and (D), 50 wm; (F), 100 um; (B), 200 um. GE, 
ganglionic eminence. 
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the migration of most interneurons (fig. $3, A 
and B). This was further supported by the ven- 
tral accumulation of a cohort of 5-ethynyl-2’- 
deoxyuridine (EdU)-positive interneurons born 
at E115 in the forebrain of E13.5 OPC-depleted 
embryos (fig. S3, C to E). Despite the progressive 
increase of VOPC numbers generated by the 
second wave in OPC-depleted cortex (where 
most first-wave vOPCs had been eliminated), 
the number of interneurons in the cortical 
wall remained decreased at E18.5, just before 
the death of embryos (fig. 83, H and I). This 
defect seems specific to interneurons because 
cortical distribution of microglia, which mi- 
grate concomitantly with both interneurons 
and vOPCs, remained unchanged at E13.5 (fig. 
$3, J and K). Moreover, depleting interneurons 
by expressing diphteria toxin subunit A (DTA) 
in D1x5,6* cells (DIx5,6:Cre-GFP;R26:DTA) had 
no effect on the number or localization of 
vOPCs within the cortical parenchyma (fig. S3, 
L to N). These results contrast with those ob- 
served with OPCs generated dorsally around 
birth, the generation of which is impaired upon 
loss of interneurons (/6). Altogether, these re- 
sults suggest that first-wave vOPCs selectively 
influence the forebrain distribution and sorting 
of interneurons into the cortical wall. 


vOPCs prevent interneuron—blood 
vessel interaction 


During late development, vOPCs migrate in 
close association to blood vessels (13). We ob- 
served that during the early developmental 
stages, when the cortical wall is less vascular- 
ized, first-wave vOPCs migrate independently 
of blood vessels with similar dynamics, suggest- 
ing an intrinsic regulation of their migration 
(fig. S4, A to E). The percentage of VOPCs mi- 
grating on blood vessels increases as cortico- 
genesis proceeds, following the spatiotemporal 
progression of cortical angiogenesis (17) (E13.5: 
52.46 + 4.45%; E16.5: 77.52 + 7.86%; fig. S4, A 
and F). Our analysis may underestimate the 
blood vessel coverage by vOPCs because these 
cells extend thin processes between blood ves- 
sels while migrating, which are difficult to image 
by time-lapse recordings. Although interneu- 
rons do not preferentially migrate on blood ves- 
sels, they can be detected in their vicinity (78). 

We tested whether the impaired forebrain 
distribution of interneurons upon depletion of 
first-wave vOPCs results from a change in their 
migratory behavior. We crossed OPC-depleted 
mice with D1x5,6:Cre-GFP mice to generate 
GFP*cIN-OPC-depleted embryos and follow 
the migration of GFP* interneurons (Fig. 2A). 
Live imaging of organotypic brain slices from 
E13.5 embryos showed a reduction of cortical 
invasion by interneurons in GFP*cIN-OPC- 
depleted embryos (Fig. 3, A and B, and movie 
S3). In brain slices from E13.5 GFP*cIN-OPC- 
depleted embryos, interneurons were migrating 
more slowly (Fig. 3C) and closer to isolectinB4- 
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labeled blood vessels in the cortical wall (Fig. 
3D) and in the ventral forebrain (fig. S3, C to 
G). At E13.5, the migration parameters of in- 
terneurons in cortical parenchyma devoid of 
blood vessels were comparable between GFP* 
cIN-OPC-depleted and control cortices (fig. S5, 
A to D), suggesting that changes in interneu- 
ron migration between genotypes result from 
a lack of blood vessel coverage by first-wave 
vOPCs. In the forebrain, blood vessels are an 
important source of Cxcl12 (73) (fig. S5, E and 
F), which attracts vVOPCs that express the C-X-C 
chemokine receptor 4 (Cxcr4) upon activation 
of Wnt signaling (13). Although loss of Ng2* 
OPCs results in an impaired blood vessel net- 
work before birth (19), the vasculature was not 
affected at E13.5 upon first-wave vOPC deple- 
tion in OPC-depleted embryos (fig. S5, G to S). 
This was further supported by a proteomic 
analysis of the cortical wall that did not in- 
dicate specific molecular changes associated 
with blood vessels (despite detected proteomic 
changes associated with the loss of first-wave 
vOPCs) (fig. S5, T and U). Moreover, we did not 
detect a change of Cxcl12 amount in cortical ex- 
tracts from E13.5 OPC-depleted embryos com- 
pared with their controls (fig. S5V and table S4). 

The ordered migration of interneurons in 
cortical streams relies mostly on gradients of 
chemoattractant molecules, including Cxcl12, 
to which interneurons are responsive through 
their expression of Cxcr4 (78, 20-22). There- 
fore, we postulated that a loss of blood vessel 
coverage by first-wave vOPCs in E13.5 OPC- 
depleted embryos would free a local source of 
Cxcll2, resulting in attraction and rerouting 
of interneurons toward cortical blood vessels. 
To test this hypothesis, we incubated organo- 
typic brain slices from E13.5 GFP*cIN-OPC- 
depleted embryos with Cxcl12-blocking anti- 
bodies (Cxcl12 Ab) and measured the distance 
traveled by interneurons after contacting a 
blood vessel. We found that interneurons spent 
less time interacting with blood vessels and 
migrated farther away from these structures 
upon initial contact in GFP*cIN-OPC-depleted 
brain slices incubated with Cxcl12 Ab (Fig. 3E 
and movie S4). These changes correlated with 
an increased average speed of interneurons 
(Fig. 3F). Incubation of control brain slices 
(GFP*cIN-control) with Cxcl12 Ab did not 
modify the distance traveled by interneurons 
from blood vessels (Fig. 3E). The average 
speed was also similar between Cxcl12 treat- 
ment conditions (Fig. 3F). Altogether, these 
results suggest that the coverage of blood 
vessels by first-wave vOPCs prevents the Cxcl12- 
mediated interaction between interneurons 
and blood vessels. To further support this 
hypothesis, we induced first-wave vOPC de- 
tachment from the vasculature by blocking 
Wnt signaling with XAV939 in control brain 
slices (13). This pharmacological induction 
of first-wave vOPC detachment led to an ac- 
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cumulation of interneurons around blood ves- 
sels that were migrating more slowly (Fig. 3, 
G and H, and movie S5). Blood vessels also 
release Cxcll1 (23, 24) that selectively binds 
to C-X-C chemokine receptor type 7 (Cxcr7), 
which acts as a Cxcll2 scavenger and main- 
tains Cxcr4 activation by preventing its inter- 
nalization (25-27). We thus blocked Cxcll11 
with antibodies (Cxcll11 Ab) and found that it 
rescued interneuron migration in GFP*cIN- 
OPC-depleted cortices, likely by preventing 
Cxcr4 internalization upon blood vessel-released 
Cxcl12 activation (Fig. 3, I and J). Further- 
more, we found that first-wave vOPCs express 
higher levels of both Cxcr4 and Cxcr7 com- 
pared with interneurons (Fig. 3, K and L), 
which may help them to migrate more effi- 
ciently than interneurons along blood vessels. 


vOPCs modulate interneuron migration by 
unidirectional contact repulsion 


We found that first-wave vOPC depletion im- 
pairs the distribution and migration of con- 
comitantly born interneurons in forebrain 
parenchyma, including along its blood vessels, 
suggesting the existence of a cross-talk be- 
tween first-wave vOPCs and interneurons. Cul- 
turing medial ganglionic eminence explants 
from E13.5 Nkx2.1:Cre; R26ACT7om' embryos 
[containing TdTomato* (Tom*) interneurons] 
with dissociated preoptic area isolated from 
E13.5 Sox10:GFP-DTA embryos (enriched in 
GFP” first-wave vOPCs) in compartmented 
dishes did not change the migration of Tom* 
interneurons around their medial ganglionic 
eminence core explants (Fig. 4, A to H). This 
suggests that factors released by first-wave 
vOPCs in culture do not influence interneuron 
migration. We next investigated whether con- 
tact between interneurons and first-wave vOPCs 
would influence their respective migration be- 
havior. To test this, we cocultured medial 
ganglionic eminence and preoptic area ex- 
plants (with or without first-wave vOPCs), 
which include isolectinB4-labeled cells (Fig. 
4I and fig. SSW). After 48 hours of culture, 
we observed a reduction of preoptic area ex- 
plant invasion by Tom* interneurons when 
they included first-wave vOPCs (Fig. 4, J to L). 
This suggested that first-wave vOPCs prevent 
preoptic area invasion by interneurons. At the 
cellular level, live imaging showed that first- 
wave vOPC-interneuron contact is a multistep 
event during which first-wave vOPC extends 
one branch recognizing the interneuron lead- 
ing process on its growth cone-like structure, 
followed by polarity reversal and migration of 
interneuron, without altering first-wave vVOPC 
migration (Fig. 4M and movie S6). This uni- 
directional contact repulsion (UCoRe) ended on 
average 78.6 + 6.11 min after the initial contact 
(Fig. 4N) and was highly specific, only occur- 
ring between first-wave vOPCs and interneu- 
rons (Fig. 4, O to R). Unidirectional contact 
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Fig. 3. First-wave vOPC depletion leads to changes in interneuron migration 
behavior. (A and B) Images showing the distribution of interneurons (green) around 
blood vessels (purple) on organotypic brain slices from E13.5 GFP*cIN-control (A) 
and GFP*cIN-OPC-depleted embryos (B). Right panels are magnified images of 
corresponding left panels. White arrowheads in (B) indicate interneurons located in 
the vicinity of blood vessels. (C) Distribution of average migration speed of 
interneurons moving along cortical blood vessels in E13.5 GFP*cIN-control and 
GFP*cIN-OPC-depleted brain slices (n = 25 to 28 cells; ****P < 0.0001, unpaired 
t test). (D) Histogram of the distance traveled by interneurons, starting from their 
closest position to blood vessels, in brain slices from E13.5 GFP*cIN-control or 
GFP*cIN-OPC-depleted embryos. Time 0 corresponds to initial contact of 
interneuron with a blood vessel (n = 23 cells; **P = 0.0048 after 30 min, ****P < 
0.0001 after 60 to 180 min, two-way ANOVA). (E) Histogram of the distance traveled 
by interneurons, starting from their closest position to blood vessels, in brain 
slices from E13.5 GFP*cIN-control or from GFP*cIN-OPC-depleted embryos (n = 24 
to 25 cells; ***P = 0.0006 after 90 min, ****P < 0.0001 after 120 to 180 min, 
two-way ANOVA). (F) Distribution of average migration speed of interneurons moving 
along cortical blood vessels on Cxcll2 Ab-treated or vehicle-treated organotypic 
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brain slices from E13.5 GFP*cIN-control and GFP*cIN-OPC-depleted embryos (n = 25 
to 26 cells; ****P < 0.0001, one-way ANOVA). (G) Histogram of the displacement 
of interneurons, starting from their closest position to blood vessels, on brain slices 
from E13.5 GFP*cIN-control vehicle-treated and GFP*cIN-control XAV939-treated 
embryos (n = 23 to 28 cells; ****P < 0.0001 after 30 to 180 min, two-way ANOVA). 
(H) Distribution of average migration speed of interneurons moving along cortical 
blood vessels on organotypic brain slices from E13.5 GFP*cIN-control vehicle-treated 
or GFP*cIN-control XAV939-treated embryos (n = 24 to 28 cells; ****P < 0.0001, 
unpaired t test). (I) Histogram of the distance traveled by interneurons, starting from 
their closest position to blood vessels, in brain slices from E13.5 GFP*cIN-control 

or GFP*cIN-OPC-depleted embryos (n = 30 cells; **P = 0.0069 after 60 min, ****P < 
0.0001 after 90 to 180 min, two-way ANOVA). (J) Distribution of average migration 
speed of interneurons moving along cortical blood vessels on Cxcl11 Ab-treated or 
vehicle-treated organotypic brain slices from E13.5 GFP*clN-control or GFP*cIN-OPC- 
depleted embryos (n = 30 cells; ****P < 0.0001, one-way ANOVA). (K and L) Dot 
plots of relative mRNA expression levels of Cxcr7 and Cxcr4 in first-wave vOPCs, 
compared with interneurons (n = 3 samples; *P = 0.014 and **P = 0.0029, 
respectively, unpaired t test). Scale bars: (A) and (B), 50 um. 
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Fig. 4. Analysis of first-wave vOPC-interneuron interactions. (A and B) Exper- 
imental culture setup to test whether first-wave vOPC secretome influences 
interneuron migration. (€ and D) Images showing cocultures of interneurons with or 
without first-wave vOPCs, as depicted in (A) and (B). (E to H) Phase histograms 
showing the cell percentage [(E) and (F)] or the distance traveled by interneurons 
(G) and (H)] from the explant core in the culture as depicted in (A) and (B). 

(I) Experimental in vitro setup to assess the ability of interneurons (red) to invade 
the preoptic area that contains or does not contain first-wave vOPCs (green). (J and 
K) Representative images of medial ganglionic eminence-derived explants isolated 
from organotypic brain slices of E13.5 Nkx2.1:Cre;R26°46"47™t° embryos cocultured 
with preoptic area—derived explants obtained from organotypic brain slices of 

E13.5 Soxl0:GFP-DTA (J) or OPC-depleted (K) embryos. (L) Absolute numbers of 
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repulsion is a distinct mode of cell repulsion 
compared with the mutual ones occurring 
between pairs of first-wave vOPCs (39.0% of 
the recorded contacts) or interneurons (8.7% 
of the recorded contacts) that never exper- 
ienced polarity reversal (Fig. 4, Q and R). This 
mode of cell repulsion likely occurs in vivo as it 
guides interneuron migration in cortical regions 
from organotypic slices in the vicinity of or at a 
distance from blood vessels through dynamic 
branching of first-wave vOPCs (Fig. 4P and 
movies S7 and S8). 

We next performed coculture of E16.5 lat- 
eral ganglionic eminence explants containing 
second-wave vOPCs (Sox10:venus/GFP; green) 
with homochronic medial ganglionic eminence 
explants (NKx2.1:Cre;R264CT70™". red) con- 
taining interneurons or homochronic caudal 
ganglionic eminence explants (DIx5,6:Cre-GFP; 
R26ACTTomato. vellow) containing interneu- 
rons. In both cultures, second-wave vOPCs 
did not promote unidirectional contact repul- 
sion to most medial ganglionic eminence- or 
caudal ganglionic eminence-derived interneu- 
rons (fig. S6, A to D). 


Unidirectional contact repulsion relies on 
atypical semaphorin-plexin signaling 


To identify the molecular mechanism under- 
lying unidirectional contact repulsion, we per- 
formed enrichment analysis of a gene set 
obtained by RNA sequencing of GFP” inter- 
neurons purified by fluorescence-activated cell 
sorting (FACS) from E13.5 D1x5,6:Cre-GPF em- 
bryos (28). We ran gene ontology (GO) analysis 
(29) to identify the GO term related to chemotaxis 
and refined the analysis by selecting genes 
coding for transmembrane receptors and 
membrane-bound signaling molecules (table 
S1). Among the 10 genes showing the highest 
enrichment in interneurons, we found several 
members of the Eph-ephrin and Plexin (Plxn)- 
Semaphorin (Sema) signaling families, two 
families of receptor ligands involved in contact- 
repulsion mechanisms (30, 37) (fig. S7, Aand 
B) that are also expressed in vOPCs (7). We 
next tested whether these signaling pathways 
underlie unidirectional contact repulsion in 
cocultures of preoptic area and medial gan- 
glionic eminence explants containing GFP* 
first-wave vOPCs and Tom* interneurons, re- 
spectively. We showed that the pharmacolog- 
ical blockade of type a and b Eph receptor 
families with 200 uM of the competitive and 
reversible ligand lithocholic acid (32) did not 
affect unidirectional contact repulsion (Fig. 
40 and fig. S7C). Therefore, we investigated 
whether Plxn-Sema signaling underlies uni- 
directional contact repulsion by knocking down 
simultaneously several Plxn members (Plxnal, 
a2, a3, bl, and b2) with expression of a small 
interfering RNA (siRNA) pool in interneurons. 
Unidirectional contact repulsion was largely 
impaired upon expression of the Plxn siRNA 
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pool (90.90% of the first-wave vOPC-interneuron 
contacts) compared with its control (fig. S7, D 
and E). While testing the involvement of each 
individual Plxn in unidirectional contact re- 
pulsion, we found that only targeting Plxna3 
in interneurons abolished unidirectional con- 
tact repulsion (15.40%; Fig. 5, A to D, and fig. 
S87, F to K). We observed that the knockdown 
of Plxnb2 induced growth cone retraction with- 
out unidirectional contact repulsion (fig. S7L). 
Analysis of bulk RNA sequencing from first- 
wave vOPCs (7) revealed expression of several 
Semas (table S2). To identify the Sema(s) ex- 
pressed by first-wave vOPCs that would part- 
ner with Plxna3 in unidirectional contact 
repulsion, we excluded secreted Semas and fo- 
cused our analysis on transmembrane class 5 
and 6 Semas, which are known receptors for 
a-class Plxns (33). The analysis of GFP* first- 
wave vOPCs FACS-isolated from E13.5 Sox10: 
GFP-DTA embryos showed that messengers of 
all class 5 and 6 Semas are expressed by first- 
wave vOPCs (fig. S8A). We next tried to knock 
down these Semas in first-wave vOPCs by 
siRNAs or infection with short hairpin lenti- 
viruses. Unfortunately, we were not able to 
transduce first-wave vOPCs. As an alternative, 
we used Neuro2A (N2A) cells, in which the 
endogenous expression of class 5 and 6 Semas 
is low (fig. S8, B to G). Accordingly, we de- 
tected unidirectional contact repulsion in 
a small fraction of E13.5 Tom* interneurons 
that were cocultured with N2A cells (Fig. 5, E 
and H). The frequency of unidirectional con- 
tact repulsion was increased after overexpres- 
sion of Sema6a and Sema6éb (Fig. 5, F, G, I, and 
J), but not of other class 5 and 6 Semas, in N2A 
cells (fig. S8, H to K). By knocking down 
Plxna3 in interneurons, we demonstrated that 
the increase in unidirectional contact-repulsion 
frequency triggered upon Sema6a or Sema6b 
overexpression in N2A cells required endoge- 
nous expression of Plxna3 by interneurons 
(Fig. 5, K and L). This further suggests that a 
transient interaction between interneuron 
Plxna3 and first-wave vVOPC Semaé6a/6b drives 
unidirectional contact repulsion. To further 
test this hypothesis, we used blocking anti- 
bodies that recognize both Sema6a and Sema6b 
(Sema6a/6b; fig. S8, L to N) in coculture of 
interneurons and first-wave vOPCs, which dose- 
dependently abolished unidirectional contact 
repulsion (Fig. 5, M and N, and fig. S80). At 
the subcellular level, we detected some enrich- 
ment of Plxna3 and Sema6a/6b signal in growth 
cones of interneurons and first-wave vOPCs, 
respectively (Fig. 5,O and P). Such subcellular 
location suggests that these molecules are im- 
portant players to drive growth cone collapse 
and lead process retraction during unidirec- 
tional contact repulsion. We further investi- 
gated polarity reversal in interneurons by 
electroporating centrin-GFP and memb-Cherry. 
We observed a sequence during which the 
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growth cone of the leading process initiates 
retraction just before the regrowth of the new 
leading process at the opposite side, whereas 
the centrosome (centrin-GFP) initiates its migra- 
tion later. Because the centrosome acts as 
microtubule organizing center (MTOC), our 
results suggest that microtubules are not ini- 
tially involved in the growth of the new lead- 
ing process after interacting with first-wave 
vOPCs or N2A cells overexpressing Sema6b 
(Fig. 5, Q and R, and movies S9 and S10). Al- 
together, our data demonstrate that unidi- 
rectional contact repulsion is driven by an 
interaction between Plxna3 and Sema6a/6b, 
further suggesting that first-wave vOPCs use a 
singular mechanism to signal to interneurons 
during development. 


Discussion 


Here, we provide a characterization of the mi- 
gration pattern of first-wave vOPCs in the de- 
veloping mouse brain. Despite being born in 
shared subpallial germinal regions, VOPCs and 
interneurons use distinct migration modes 
and occupy mutually exclusive forebrain terri- 
tories while moving toward the cortex. Moreover, 
these cell types undergo saltatory migration 
characterized by different kinetics, and al- 
though vOPCs preferentially use blood vessels 
to move in the cortex, interneurons navigate 
in organized tangential streams. Among the 
important diffusible cues that guide neural 
migration in the cortex, the chemokine Cxcl12 
attracts both interneurons and vOPCs. The 
Cxcl12 released by intermediate progenitors in 
the cortex marks a migratory stream that 
steers interneurons within the IZ or SVZ 
(21, 34). This chemokine is also secreted by 
forebrain blood vessels and attracts VOPCs 
(13). Whereas genetic ablation of interneurons 
does not affect the cortical distribution of 
vOPCs, loss of first-wave vOPCs impairs inter- 
neuron migration in the cortical wall, further 
suggesting that interneurons require first- 
wave vOPCs to properly navigate within the 
cerebral cortex. We showed that first-wave 
vOPC ablation increases the local availability 
of Cxcl12 in the vicinity of blood vessels to 
which interneurons are then attracted, thereby 
altering their progression toward the cortex. 
Because both cell types are responsive to Cxcl12, 
we postulated that an active mechanism car- 
ried on by first-wave vOPCs prevents inter- 
neurons from migrating along blood vessels. 
Our live-imaging analysis indeed demonstrated 
that first- but not second-wave vOPCs perform 
unidirectional contact repulsion on interneu- 
rons, thus guiding them to prevent their inter- 
action with blood vessels while contributing 
to their overall progression into the forebrain. 
This mode of contact repulsion with polarity re- 
versal is specific to first-wave vOPC-interneuron 
pairs and distinct from self-repulsion, as pre- 
viously described for OPCs themselves (35) and 
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first-wave vOPCs (green) while being incubated with vehicle (M) or 10 ng/ml of 
anti-Sema6b antibody (N) (n = 10 to 17 cells; ****P < 0.0001, Fisher's exact 
test). (0) Immunolabeling of an interneuron (SiR-Actin, red) and Plxna3 
(white). Nuclei were counterstained with DAPI (blue). (P) Immunolabeling of 
a first-wave vOPC (GFP*, green) and Sema6a/6b (white). Nuclei were 


counterstained with DAPI (blue). Merges are the cor 


Cajal-Retzius cells (36, 37). Unidirectional 
contact repulsion on interneurons involves a 
sequence of events that starts with the growth 
cone collapse of their leading process and its 
retraction, followed by the regrowth of a new 
leading process at the opposite side of the cell 
body. We observed that centrosome migration 
(which acts as a MTOC) from the base of the 
disassembling leading process to the base of 
the new one occurs after growth initiation of 
the new process. This suggests that growth ini- 
tiation of the leading process is independent 
of nucleation and polymerization of microtu- 
bules from the MTOC. At the molecular level, 
we identified that an atypical interaction be- 
tween Plxna3 and Sema6a/6b expressed by 
interneurons and first-wave vOPCs, respec- 
tively, triggers a signaling pathway promoting 
unidirectional contact repulsion. Our attempt 
to detect a physical interaction between these 
molecules by coimmunoprecipitation on ex- 
tracts from cocultured N2A cells transfected 
with either Plxna3 or Sema6b was unsuccess- 
ful, likely because such interaction is short 
(<10 min, as reported in Fig. 5R). Moreover, 
the nature of the intracellular cascade acti- 
vated in interneurons upon first-wave vOPC 
interaction remains to be clarified. The reduc- 
tion of interneuron number in the cortex upon 
first-wave vOPC depletion is still observed just 
before birth, an age during which second-wave 
vOPCs are actively migrating. This suggests 
that second-wave vOPCs cannot rescue the 
migration of interneurons, which is further 
supported by the demonstration that second- 
wave vOPCs do not perform unidirectional 
contact repulsion to most homochronic inter- 
neurons. We observed that interneuron deple- 
tion does not have a significant impact on the 
generation, survival, and migration of first- 
and second-wave vOPCs. However, by releasing 
factors such as Sonic hedgehog and fraktaline, 
interneurons control the generation of dor- 
sally born OPCs (16, 38). This is another ex- 
ample suggesting that interneuron-OPC 
cross-talk varies spatially and temporally. 
Some first-wave vOPCs and interneurons es- 
tablish specific interaction to form transient 
synapses at early postnatal stages and later 
engage in myelination of specific local circuits 
(5, 11). However, most first-wave vOPCs die 
before postnatal day 10 (17). Therefore, by show- 
ing that first-wave vOPCs support the migra- 
tion of interneurons in the developing cerebral 
cortex, our work identifies a noncanonical and 
transient role for first-wave vOPCs. Further- 
more, whereas unidirectional contact repul- 
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sion occurs during development between pairs 
of interneurons and first-wave vOPCs, some 
pairs must express mutual recognition molec- 
ular codes after birth to promote longer-term 
interaction for the formation of transient syn- 
apses and later axon myelination. 

Our results show that despite being less 
numerous, first-wave vOPCs support the mi- 
gration of interneurons across cortical territo- 
ries by using unidirectional contact repulsion. 
This mechanism allows both cell populations 
to adopt specific migration patterns in the 
cortex while also being attracted by Cxl12. 
Blood vessels are found throughout the corti- 
cal wall and are indeed an important source of 
Cxcl12 that, if not covered and consumed by 
first-wave vOPCs, may affect interneuron mi- 
gration by interfering with the existing cortical 
IZ and MZ Cxcll2 gradients generated by in- 
termediate progenitors and meningeal cells, 
respectively (27, 39). Unidirectional contact 
repulsion contributes to the guidance of in- 
terneurons as the cortex matures and gets 
more perfused by blood vessels, which are a 
source of Cxcl12 release. 


Methods summary 


Genetic mouse models were used to visualize 
or deplete either vOPCs or interneurons. First- 
wave VOPC progenies were analyzed at E13.5 
in Nkx2.1:Cre;R26447470™0 and Dbx1:Cre; 
R2GCAGTATomato mouse models. Soxl0:GFP- 
DTA and DIx5,6:Cre-GFP E13.5 and E16.5 brains 
were processed into organotypic slices to track 
the migration of vOPCs and cortical interneu- 
rons, respectively. Brains from Sox10:GFP-DTA, 
Sox10:Venus, Nkx2.1:Cre;R26ACTATomato, ang 
D1x5,6:Cre-GFP;R264CTTomalo mice were mi- 
crodissected into explants and cultured to 
analyze unidirectional contact repulsion be- 
tween vOPCs and interneurons. Sox10:GFP- 
DTA females were crossed with either Pdgfra: 
Cre™8T or Dlx5,6:CreGFP;Pdefra:Cre"’? males 
to deplete first-wave vOPCs after intraperi- 
toneal injections of tamoxifen at E11.5 and 
E12.5. A model of interneuron depletion was 
generated by breeding R26:DTA females with 
D1x5,6:Cre-GFP males. 

For all histological analyses, embryonic heads 
or brains were fixed in 4% paraformaldehyde 
overnight at 4°C and then cryopreserved in a 
20% sucrose solution diluted in phosphate- 
buffered saline (PBS) overnight at 4°C. Brains 
were then frozen in optimal cutting temper- 
ature medium and kept at -80°C until cryostat 
cutting. Brain slices were permeabilized, blocked 
in PBS/0.3% Triton X-100/10% donkey serum, 
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regions. (Q) Scheme of UCoRe sequence when an interneuron contacts a first- 
wave vOPC or a N2A cell overexpressing Sema6a or Sema6b. Growth cone 
retraction is in blue, centrosome (or MTOC) in green, and the extending new 
leading process in red. (R) Graph showing the order and timing of UCoRe 
sequence. Scale bars: (0) and (P), 10 um; (A), (B), (E), (F), and (G), 50 um. 
KD, knockdown; OE, overexpression. 


and then incubated overnight at 4°C with pri- 
mary antibodies diluted in PBS/0.3% Triton 
X-100/1% donkey serum. After three rinses 
with PBS/0.3% Triton X-100/5% donkey serum, 
slices were incubated in the same solution 
containing secondary antibodies for 2 hours at 
room temperature. Images were obtained using 
an inverted confocal microscope (Nikon A1). 

Organotypic slice cultures from embryonic 
brains were used to study the migration of 
vOPCs and cortical interneurons. Brains were 
processed into 300-1.m coronal slices and har- 
vested on a Millicell(Millipore) culture insert 
placed on glass-bottomed dishes. Dishes were 
filled with neurobasal medium supplemented 
in B27, N2, and glutamine. Slices were kept in 
a humidified incubator saturated with 5% CO. 
for a maximum of 8 hours before time-lapse 
recording. IsolectinB4 was added on top of 
organotypic slices to stain blood vessels. For 
every treatment (Cxclll Ab, Cxcl12 Ab, or 
XAV939), molecules were added on top of 
organotypic slices as well. Time-lapse acqui- 
sitions were obtained every 5 min for at least 
5 hours using an inverted confocal micro- 
scope. Analyses of migration parameters were 
performed using the mTrackJ plugin in Fiji. 

Study of vOPC-interneuron interaction was 
performed in a two-dimensional culture of 
primary vOPCs and interneurons. These cul- 
tures were generated by placing microdissected 
ganglionic eminence and preoptic area ex- 
plants on top of a homochronic cortical feeder. 
Briefly, glass-bottomed dishes were coated with 
poly-ornithine and laminin. The feeder layer 
of cortical cells was prepared by mechanical 
dissociation of either E13.6 or E16.5 wild-type 
cortices in supplemented neurobasal medium. 
The dissociated cells were plated in coated 
glass-bottomed dishes and left to adhere for 
2 hours at 37°C in a humidified incubator 
saturated with 5% COs,. The next day, gangli- 
onic eminence or preoptic area explants of 
~300 um were then plated on top of the cor- 
tical feeder layer. To knock down Plexins in 
interneurons at E13.5, ex vivo electroporation 
of medial ganglionic eminences was performed 
before microdissecting the explants. For analy- 
ses of contact repulsion between vOPCs and 
interneurons, time-lapse acquisitions were ob- 
tained every 5 min for at least 10 hours using 
an inverted confocal microscope. 
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CELL BIOLOGY 


The deubiquitinase USP8 targets ESCRT-III 
to promote incomplete cell division 


Juliette Mathieu*, Pascale Michel-Hissier, Virginie Boucherit, Jean-René Huynh* 


In many vertebrate and invertebrate organisms, gametes develop within groups of interconnected cells called 
germline cysts formed by several rounds of incomplete divisions. We found that loss of the deubiquitinase 
USP8 gene in Drosophila can transform incomplete divisions of germline cells into complete divisions. 
Conversely, overexpression of USP8 in germline stem cells is sufficient for the reverse transformation from 
complete to incomplete cytokinesis. The ESCRT-III proteins CHMP2B and Shrub/CHMP4 are targets of 
USP8 deubiquitinating activity. In Usp8 mutant sister cells, ectopic recruitment of ESCRT proteins at 
intercellular bridges causes cysts to break apart. A Shrub/CHMP4 variant that cannot be ubiquitinated 
does not localize at abscission bridges and cannot complete abscission. Our results uncover ubiquitination 
of ESCRT-III as a major switch between two types of cell division. 


lonal multicellularity arises by incom- 

plete divisions of single-cell precursors 

and gives rise to groups of sister cells 

linked by cytoplasmic bridges. Such 

mechanisms underlie the formation of 
colonies of unicellular organisms and the em- 
ergence of multicellularity (1). It is also a con- 
served feature of germline cell development, 
where cysts of sister cells share the same cyto- 
plasm (2). During Drosophila oogenesis, cells 
display complete or incomplete cytokinesis. 
Indeed, germline stem cells (GSCs) first divide 
completely to produce a single-cell precursor 
called a cystoblast (CB). Then, each CB goes 
through four rounds of incomplete division to 
generate germline cysts made of 16 sister cells, 
all linked by cytoplasmic bridges called ring 
canals (RCs) (3) (Fig. 1, A and B). 


USP8 is necessary and sufficient to promote 
incomplete divisions 


To identify genes involved in cyst formation, we 
expressed a collection of short hairpin RNAs 
(shRNAs) and screened for egg chambers with 
an abnormal number of nuclei. Two inde- 
pendent shRNA lines targeting the deubiquiti- 
nating enzyme USP8 (4) induced the formation 
of egg chambers made of eight, four, or two 
nuclei, instead of 16 (Fig. 1, C and E, and fig. S1, 
A and B). These egg chambers resulted from 
partial breaking of cysts due to ectopic cyto- 
kinetic abscission in the germarium. Indeed, 
the fusome, an endoplasmic reticulum (ER)— 
derived organelle, formed a midbody-like 
structure, a feature of complete abscission, 
which is not seen in wild-type (WT) cysts (Fig. 
1, D and F, and fig. S1, B and C) (5, 6). We 
demonstrated that they were complete abscis- 
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sions by showing that photoactivatable mole- 
cules were not able to diffuse between sister 
cells, when the midbody structure was present 
(fig. S2, A and B, and movies S1 to S4). Fur- 
thermore, using live imaging, we observed the 
physical separation of the fusome in mutant 
cysts (fig. $2, C and D, and movies S5 and S6). 
Depletion of Usp8 caused similar ectopic ab- 
scissions in male germ cells (fig. S1, D and E). 
Homozygous mutant cysts for a null allele of 
Usp8 (7) [Usp8*° germline clone (GLC)] ex- 
hibited the same complete abscission pheno- 
type, arguing for the specificity of these shRNAs 
(fig. S1, F to H). To rule out the possibility that 
complete abscissions of Usp8-depleted cysts 
were caused by dedifferentiation of cysts into 
GSCs (8, 9), we verified that Usp8-depleted 
cysts still expressed the differentiation marker 
Bam (fig. $3, A and B). Usp8*° GLC also lacked 
Nanos expression at four-cell cysts (J0), sim- 
ilarly to control differentiating cysts (fig. S3C). 
In addition, we found that mutant cysts could 
enter meiosis, although an oocyte was never 
maintained (fig. S3, D and E). Together, these 
observations indicated that Usp8 is necessary to 
maintain incomplete cytokinesis in Drosophila 
germline cysts. 

Conversely, we tested whether USP8 could 
be sufficient to inhibit cytokinesis in cells that 
normally complete abscission, such as GSCs. 
We overexpressed a wild-type form of USP8 
in these stem cells and observed the forma- 
tion of stabilized cytoplasmic bridges in stem 
cells (“stem cysts”) (11) (Fig. 1H), which led to 
CBs made of two cells and egg chambers made 
of 32 rather than 16 cells (Fig. 1G and fig. S4A) 
(6, 11, 12). Incomplete cytokinesis in GSCs 
overexpressing USP8 was not caused by pre- 
mature differentiation of GSCs, as these GSCs 
did not express the cyst-specific protein Bam 
(Fig. 1H). Similarly, we found that overexpress- 
ing USP8 in somatic follicle cells surrounding 


germline cells and in Schneider 2 (S2) cells 
also led to the formation of ectopic cysts (fig. 
S4, C to I). Taken together, these results de- 
monstrated that USP8 is necessary and suf- 
ficient to promote incomplete divisions in 
germline and somatic cells. 


USP8 deubiquitinates ESCRT-III proteins 


Next, we tested whether USP8 deubiquitinase 
activity (DUB) was required for the regulation 
of abscission. We expressed a catalytically inac- 
tive form of USP8, called USP8-C572A (7, 13, 14), 
in Usp8*° cysts and found that, in contrast to 
wild-type USP8, it was unable to rescue germ- 
line cysts from breaking (Fig. 2, A and B, and 
fig. S5, A to C). Similarly, overexpression of 
USP8-C572A was not sufficient to arrest cyto- 
kinesis or induce the formation of stem cysts 
in GSCs, follicle cells, or S2 cells (fig. S4, A to I). 
Hence, USP8 DUB activity is required for the 
regulation of abscission. 

We observed that USP8 localized at sites of 
cytokinesis in both males and females, that is, 
in midbodies of GSC/CB pairs, RCs of germ- 
line cysts, and follicle cell RCs (Fig. 2, C and 
D, and fig. S6, A to C). In the absence of Usp8, 
we noticed an accumulation of ubiquitinated 
proteins at ectopic midbodies (Fig. 2, E and 
F), in addition to cytoplasmic aggregates, as 
described in mammals (fig. S7, A to F). These 
data suggest that the relevant targets of USP8 
could be localized at abscission sites. 

The N-terminal microtubule interacting and 
transport (MIT) domain of human USP8 in- 
teracts with ESCRT-III proteins, which are 
thought to be the driving force of several mem- 
brane scission events, including abscission 
(15-18). We found that three Drosophila ESCRT- 
III proteins, CHMP!1 (charged multivesicular 
body protein 1), CHMP2B (charged multi- 
vesicular body protein 2b), and Shrub (the 
unique Drosophila homolog of CHMP4), could 
immunoprecipitate USP8 (Fig. 3A and fig. 
S8A) (19-21). All three proteins were ubiquiti- 
nated in Drosophila cells (Fig. 3B and fig. S8B). 
Expressing a wild-type form of USP8 was 
sufficient to greatly reduce the amount of 
ubiquitinated CHMP2B and Shrub, but not 
of CHMP!1 (Fig. 3B and fig. S8B). In contrast, 
the inactive USP8-C572A had no effect on the 
ubiquitination levels, although it could still 
bind all three ESCRT-III proteins (Fig. 3, A 
and B, and fig. S8, A and B). Conversely, knock- 
ing down Usp8 in S82 cells led to increased 
levels of ubiquitinated Shrub and CHMP2B 
(fig. S8, C to E). We concluded that, unlike 
in intralumenal vesicle formation in yeast 
(22), USP8 is necessary and sufficient to de- 
ubiquitinate CHMP2B and Shrub. 


USP8 regulates ESCRT-III accumulation at 
the intercellular bridge 


We then investigated whether CHMP2B and 
Shrub could be relevant targets in vivo. Shrub, 
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Fig. 1. USP8 is necessary and sufficient to 
block abscission in the germline lineage. 

(A) (Top) Female ovariole. (Bottom) Cell lineage 
of the germline stem cell (GSC). (B) Mitotic 
region of the germarium. At the end of mitosis, 
GSCs are connected to their daughter CBs 
through a ring canal (RC) (GSC #1) and later by 
a midbody (MB) (GSC #2). After complete 
cytokinesis, GSCs and CBs are fully separated 
(GSC #3). The fusome (red) links the GSC and 
the CB before abscission and also passes 
through all RCs of germline cysts. Cap cells, 
light gray; terminal filament cells, dark gray. 

(C to H’) Confocal images of fixed ovarioles [(C), 
(E), and (G)] and germaria [(D), (F), and (H)] 
of females expressing shRNA-w, shRNA-Usp8, 
or Usp8-WT under the control of nos-GAL4 
driver. [(D') and (F')] Close-up images of the 
cysts framed in (D) and (F). Dashed lines 
surround germline cysts [(D') and (F')] or a 
stem cyst that originates from the GSC [(H) and 
(H’)]. The white arrow indicates the ectopic 
MB of a breaking cyst. Scale bars: 20 wm [(C), 
(E), and (G)] and 10 um [(D), (F), and (H)]. 
ACC, four-cell cyst; 8CC, eight-cell cyst. 


Fig. 2. The deubiquitinating activity of 
USP8 is essential for its function in germ- 
line cysts. (A) Scheme of the V5-tagged 
variants of USP8 proteins. (B) Fraction of four- 
and eight-cell cysts exhibiting a breaking 
phenotype. P values (p) are indicated and 
derived from Chi square tests. The n values 
indicate the number of cysts analyzed. (© and 
D) Confocal images of germaria expressing 
green fluorescent protein (GFP)-tagged 

USP8 from the endogenous locus. (C’ and 

D’) Close-up of the cells framed in (C) and (D). 
Dashed lines surround a GSC and its daughter 
CB (C’) and a germline cyst (D'). The white 
arrow indicates the MB between the GSC and 
the CB in (C’), and the red arrow indicates the 
central RC in (D'). (E and F) Confocal images of 
germaria expressing shRNA-w (E) or shRNA- 
Usp8 (F) under the control of nos-GAL4 driver 
stained with a-ubiquitin (Ubi) and a-spectrin. 
(E’ and E’’) Close-up views of the GSC/CB pair 
framed in (E). (F’ and F’’) Close-up views of 
the breaking 4CC framed in (F). White arrows 
indicate the MB between the GSC and the CB in 
(E), and the MB of the breaking 4CC in (F). 
Scale bars: 10 um. 
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Fig. 3. USP8 regulates 

CHMP2B and Shrub ubiquitina- A 
tion levels and localization. 

(A) Immunoblots of input 

(‘In,” 1/20 of the IP) or immuno- 
precipitated (“IP”) ESCRT-III in 
proteins from S2 cells transfected 150 = 
with ESCRT-HA (hemagglutinin) 
and V5-tagged USP8 variants. 
lembranes were blotted 
with a-V5 antibody. (B) Immuno- 
blots of input (1/20 of the IP) 

or IP ubiquitinated proteins 
from S2 cells transfected with 
ESCRT-HA, His-tagged ubiquitin, 
and V5-tagged USP8 variants. 
Membranes were blotted with 
a-HA antibody to reveal ubiquiti- 
nated ESCRT-III. USP8 transfec- 
tion is sufficient to abolish 
CHMP2B and Shrub ubiquitina- 
tion (red frame). (© and 

D) Confocal images of fixed 
germaria of females expressing 
Shrub-HA under the control of 
nos-Gal4. A GSC/CB pair (C) and 
a ACC (D) are surrounded by 
dashed lines. The white arrow in 
(C) indicates the MB between 
the GSC and the CB. Shrub-HA is 
not detected at the 4CC RC. 
Scale bars: 5 um. (E) Confocal 
images of a fixed germarium 

of a female expressing Shrub-HA 
under the control of nos-Gal4 
with Usp8*° GLC [red fluorescent 
protein (RFP)-negative cells]. F 
The white arrow indicates the 
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the number of cysts analyzed. 
(G) Confocal images of a 
fixed ovariole of a female 
expressing shRNA-Usp8 in a 
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shrb®/+ background. The number of nuclei of some ovarioles are indicated. Scale bar: 20 um. 


CHMP2B, and CHMP1 localized at the RC and 
midbody of GSC/CB pairs (6) but were absent 
at cyst RCs (Fig. 3, C and D, and figs. S9, A, B, 
D, F, G, I, and J, and S10, A and C). However, in 
male and female cysts lacking Usp8, Shrub, 
CHMP2B, and CHMP1 all localized at ectopic 
midbodies and cyst RCs (Fig. 3E and figs. S9, 
C, E, H, and K, and S10, B and D). To test 
whether the ectopic localization of ESCRT-III 
was responsible for ectopic abscissions seen in 
the absence of USP8, we reduced Shrub or 
CHMP2B levels in Usp8-depleted cysts. In 
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these contexts, we counted fewer four-cell 
cysts in the process of breaking apart than in 
cysts depleted of USP8 alone (Fig. 3F and fig. 
S11, A to C). In addition, we found that older 
egg chambers contained more nuclei than did 
sh-Usp8 mutant egg chambers (fig. S11, D and 
E). We also observed complete rescue with 
egg chambers made of 16 cells and a posterior 
oocyte, which was not seen with sh-Usp8 alone 
(Fig. 3J). Altogether, we conclude that deubi- 
quitination of Shrub and CHMP2B by USP8 is 
required to block their localization at RCs and 
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subsequent abscissions, allowing the forma- 
tion of 16-cell germline cysts. 


Ubiquitination of Shrub is essential to 
promote abscission in the GSCs 


Next, we assessed the relevance of ESCRT pro- 
tein ubiquitination during complete abscission 
by testing the ability of a nonubiquitinable 
variant of Shrub (Shrub-KR) (fig. S12A) to 
replace endogenous Shrub in GSCs. When we 
removed both copies of Shrub from GSCs, we 
found that Shrub-KR only poorly compensated 
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Fig. 4. Ubiquitination of Shrub is essential to promote abscission in the 
GSCs. (A to C) Confocal images of fixed germaria with shrb°> GLC (GFP 
negative) expressing Shrb-WT-HA (B), Shrb-KR-HA (C), or no transgene (A). 
Dashed lines surround a GSC/CB pair (B) or stem cysts [(A) and (C)]. White 


for the absence of Shrub, whereas Shrub-WT 
rescued Shrub mutant germline clones (Fig. 
4, A to D, and fig. S12, A and B). In these 
nonrescued GSCs, Shrub-KR only formed cyto- 
plasmic aggregates and did not localize at RCs 
(Fig. 4C). Shrub-KR and Shrub-WT were equally 
able to fully rescue stem-cyst phenotypes 
when only one copy of Shrub was removed in 
Shrub@/+ females (fig. S12C). Like Shrub-WT, 
Shrub-KR localized at RCs and midbodies of 
Shrub™/+ heterozygous GSC/CB pairs (fig. S12, 
D and E). Furthermore, we could precipitate 
Shrub-KR and Shrub-WT in Drosophila cells, 
indicating that both proteins could interact 
physically (fig. S12F). These results showed 
that Shrub-KR was still functional and able 
to localize properly when one copy of endoge- 
nous Shrub remained in GSCs. Finally, knocking 
down Usp8 in Shrub“/+ heterozygous females 
rescued abscission arrest and the formation 
of stem cysts, indicating that ubiquitination 
could also regulate ESCRT-III in GSCs (fig. $12, 
G to I). Overall, our results show that in both 
GSCs and germline cysts, USP8 deubiquiti- 
nates ESCRT-III proteins to inhibit their local- 
ization at intercellular bridges. 


A model for the differential accumulation of 
ESCRT-III at intercellular bridges in GSCs and 
germline cysts. 


Whereas ESCRT proteins are barely detectable 
in germline cysts, they accumulate at GSC mid- 
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bodies, despite the expression of USP8 in both 
cell types (Figs. 2, C and D, and 3, C and D, and 
fig. S9). This apparent contradiction could be 
explained by a major difference in cell cycle 
length between GSCs and germline cysts. GSCs 
cycle in 24 hours and take ~15 hours to com- 
plete abscission (23, 24). In contrast, cysts go 
through four rounds of mitosis in 24 hours, 
that is, 6 hours for each cycle. In GSCs, we mea- 
sured a progressive accumulation of ESCRT-IIT 
proteins at the GSC-CB intercellular bridge 
during the 15 hours after mitosis, whereas 
ESCRT-II remained barely detectable in cyst 
cells (Fig. 5A). In contrast, in the absence of 
Usp8, CHMP2B also accumulated progressive- 
ly in cyst cells with ectopic abscission (Fig. 5A). 
We noted that the levels of USP8 varied along 
the cell cycle, peaking during mitosis (Fig. 5B). 
These observations suggest that in GSCs, ESCRT 
proteins have time to accumulate to complete 
abscission before the next mitosis, whereas in 
cysts, shorter cell cycles and the presence of 
peak levels of USP8 prevent ESCRT accumu- 
lation and stabilize incomplete divisions. In 
this model, the combination of different cell 
cycle lengths and USP8 accumulation during 
mitosis is sufficient to explain the switch from 
complete to incomplete cytokinesis (Fig. 5E). 
To test this model experimentally, we reasoned 
that lengthening the postmitotic phase of germ- 
line cysts could be sufficient for ESCRT pro- 
tein accumulation and completion of cell 


od 


arrow indicates the MB positive for Shrb-WT-HA. Scale bars: 10 um. (D) Fraction 
of GSCs exhibiting a stem-cyst or polyploid phenotype. P values (p) are 
derived from Chi Square tests. ns, not significant (P = 0.01). The n values 
indicate the number of GSCs analyzed. 


divisions. We used flies in which the differen- 
tiation of germline cysts could be synchronized 
by the expression of the bam gene following a 
heat shock (HS) (9, 25). At 18 hours after HS, 
the majority of four-cell cysts in these flies 
were devoid of ESCRT proteins as in wild-type 
cysts (Fig. 5, C and D, left, and fig. S13, A, B, D, 
and E). When we dissected flies 28 hours after 
HS, most four-cell cysts exhibited CHMP2B- 
GFP at the RCs (Fig. 5D, middle, and fig. S13, 
D and E), some of which were even breaking 
apart, as in Usp8 mutant cysts (fig. SI3F). At 
38 hours after HS, we observed an increase in 
both the amount of CHMP2B-GFP at the RCs 
and the number of four-cell cysts breaking 
apart (Fig. 5D, right, and fig. $13, E and F). 
These results show that when germline cysts 
were blocked at the four-cell stage for at least 
10 hours (from 18 to 28 hours) or 20 hours 
(from 18 to 38 hours), they accumulated ESCRT 
proteins at ring canals and completed abscis- 
sion. These results showed that lengthening 
the postmitotic phase of four-cell cysts is suf- 
ficient for ESCRT proteins to accumulate at 
ring canals and induce ectopic abscission in 
germline cysts, as in a Usp8 mutant (Fig. 5E). 

Collectively, our data demonstrate that 
ubiquitination of ESCRT-III is critical to reg- 
ulate the switch between complete and incom- 
plete divisions. We propose a model in which 
USP8 deubiquitinates the ESCRT-III proteins 
Shrub and CHMP2B, which prevents complete 
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Fig. 5. Differential accumulation A 
of ESCRT-III at intercellular 

bridges in GSCs and germline 

cysts. (A) Fluorescence intensity 
[arbitrary units (A.U.)] of CHMP2B- 

GFP at the RC (pre-MB) or the MB of 

GSC and at the central RC of cysts 

(2CC, 4CC, and 8CC) in control 

(green) or sh-Usp8 (red)-expressing 

flies. P values (p) between control B 
and shRNA-expressing flies are 

derived from Mann-Whitney tests. 

(B) Fluorescence intensity (A.U.) of 
USP8-GFP at the RC (pre-MB) or the 

MB of GSC and at the central RC 

of cysts (2CC, 4CC, 8CC, and 16CC) 

in Usp8“' flies in interphase (black) 

or mitosis (orange). P values 

(p) between interphase and mitosis 

are derived from Mann-Whitney (3 
tests. (€ and D) Confocal images and 
schemes of fusomes (stained for 
a-spectrin, red) in germaria of WT 
females expressing CHMP2B-GFP 
(C) or of bam*°° females expressing 
CHMP2B-GFP and HS-bam (D). Time 
after heat shock is indicated on 

the dark arrow between (C) and (D). 
In (D), 4CCs are observed at 18 hours 
after HS (left). CHMP2B-GFP is 

then observed at the RCs (red arrow) 
of a ACC (28 hours, middle), and 
the central RC of a 4CC breaks into a 
MB (white arrow) at 38 hours after 
HS (right). Scale bars: 5 wm. 

(E) Schematic model of ESCRT levels 
over time in the germarium. The 
GSC has a 24-hour cell cycle. 
Abscission between the newly formed 
GSC and its daughter CB occurs late 
in the cell cycle, before the next 
mitosis, and thus the GSC/CB pair 
never experiences a USP8 peak. 
During this long period, the GSC/CB 
RC therefore accumulates ESCRT, 
eaching a high level allowing abscis- 
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abscission and allows sister cells to remain 
linked by stabilized cytoplasmic bridges. Both 
GSCs and germline cysts express Usp8 and 
several ESCRT-III proteins, despite different 
outcomes for abscission. Complete and incom- 
plete abscission are thus not inherently differ- 
ent in nature. We propose that it is mainly the 
duration of abscission, regulated by the cell 
cycle, that differs in the two cases (Fig. 5E). 

A major question arising from our work is 
the evolutionary relationship between complete 
and incomplete division, and whether USP8 
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played a role in the emergence of one from 
the other. We observed the most drastic con- 
sequences of Usp8 loss in germline cysts, not 
in stem cells, indicating that the main function 
of USP8 is to promote incomplete division and 
that it seems dispensable for complete cyto- 
kinesis. Besides germ cells, unicellular organ- 
isms such as the choanoflagellates can also 
form colonies of sister cells by incomplete 
divisions. These colonies, considered a first step 
toward multicellularity, are linked by cytoplas- 
mic bridges that resemble metazoan ring canals 
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sion. The CB becomes a 16CC in 24 hours; therefore each cycle lasts only 6 hours, leaving much less time to accumulate ESCRT at RC at each cycle. In addition, 
USP8 levels at the RC in the cyst peak at each mitosis (orange bars) and remove (or block the dynamic recruitment of) ESCRTs at the RCs. We propose that 

short cell cycle duration combined with USP8 peaks in mitosis are responsible for both the low levels of ESCRT at cyst RCs and the absence of abscission. When USP8 
is absent (red), ESCRT levels increase at each cyst cycle, allowing abscission. 


(2). Given that ESCRT-III function in abscission 
is conserved from archaebacteria to verte- 
brates (26), it will be interesting to investigate 
whether ubiquitination of ESCRT-III plays a 
role in the emergence of these incomplete 
divisions. 
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MOLECULAR EVOLUTION 


Epistatic drift causes gradual decay of predictability 


in protein evolution 


Yeonwoo Park’, Brian P. H. Metzger’, Joseph W. Thornton 


1,2,3% 


Epistatic interactions can make the outcomes of evolution unpredictable, but no comprehensive data are 
available on the extent and temporal dynamics of changes in the effects of mutations as protein 
sequences evolve. Here, we use phylogenetic deep mutational scanning to measure the functional 
effect of every possible amino acid mutation in a series of ancestral and extant steroid receptor DNA 
binding domains. Across 700 million years of evolution, epistatic interactions caused the effects of most 
mutations to become decorrelated from their initial effects and their windows of evolutionary accessibility to 
open and close transiently. Most effects changed gradually and without bias at rates that were largely 
constant across time, indicating a neutral process caused by many weak epistatic interactions. Our findings 
show that protein sequences drift inexorably into contingency and unpredictability, but that the process 

is statistically predictable, given sufficient phylogenetic and experimental data. 


mutation’s evolutionary fate depends 

on its phenotypic effects. If the effects 

are stable over time, knowledge of them 

in the present can help predict the future 

course of evolution and explain the 
causes of evolutionary change in the past. 
Epistatic interactions, however, may cause a 
mutation’s effects to change over time and its 
evolutionary accessibility to become contin- 
gent on the particular sequence changes that 
preceded it during history (/, 2). 

Despite a recent tide of information about 
epistatic interactions within proteins, we lack 
a comprehensive understanding of changes in 
the effects of mutations (the set of all potential 
amino acid changes) caused by interactions 
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with substitutions (the subset of mutations 
that fix during evolution). What fraction of 
mutations change in their effects over evolu- 
tionary time, and how drastically? Do they 
change gradually or episodically and at what 
rate? What are the consequences for evolu- 
tionary outcomes? Deep mutational scanning 
(DMS) experiments have detected epistasis 
among mutations within present-day proteins 
(3-9), but these studies do not address inter- 
actions with historical substitutions or reveal 
changes in the effects of mutations over evolu- 
tionary time. Some mutations have different 
effects when introduced into various present- 
day proteins, implying epistatic interactions 
with the substitutions that occurred as these 
proteins diverged from each other (10-14); 
without polarizing and calibrating these dif- 
ferences with respect to time, however, it is not 
possible to illuminate the rate, direction, or 
regularity of the process by which the effects of 
mutations changed during evolution. Ances- 
tral protein reconstruction studies have shown 


that the effects of particular mutations changed 
during particular phylogenetic intervals (75-22), 
but these works have examined only the begin- 
ning and end of an interval and therefore cannot 
reveal the temporal dynamics of epistasis. 
Here, we address this knowledge gap by 
using DMS to comprehensively assess the 
effect of introducing every possible amino acid 
mutation into a series of reconstructed ances- 
tral and extant proteins along a densely sam- 
pled phylogenetic trajectory. We used as a 
model the DNA binding domain (DBD) of 
steroid hormone receptors, a family of essen- 
tial transcription factors in bilaterian animals 
that mediate the actions of sex and adrenal 
steroids by binding to specific DNA sequences 
and regulating the expression of target genes 
(23-25). This approach allowed us to measure 
changes in the functional effect of every pos- 
sible amino acid mutation during a series of 
defined intervals across 700 million years of 
DBD evolution. To analyze these data, we de- 
veloped a quantitative framework that treats 
each mutation’s effect as a trait that evolves 
probabilistically on a phylogeny, which we used 
to characterize the temporal dynamics, evolu- 
tionary consequences, and underlying genetic 
architecture of epistatic interactions. 


Results 
Phylogenetic DMS 


We first inferred the phylogeny of steroid and 
related receptors (Fig. 1A and fig. S1) and re- 
constructed the maximum a posteriori pro- 
tein sequences of seven ancestral DBDs along 
two lineages—one leading to the human glu- 
cocorticoid receptor (GR) and the other to the 
steroid receptor of the annelid Capitella teleta, 
which are among the most diverged of all 
functionally characterized extant DBDs. The 
reconstructed ancetral proteins include the 
ancient progenitor protein whose duplication 
and divergence gave rise to the first steroid 
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receptor (AncNR3), the ancestor of all extant 
steroid receptors (AncSR, which existed in the 
ancestor of all bilaterians), and five descen- 
dants of AncSR (Fig. 1B and fig. $2). These 
nine DBDs are separated by eight phylogenetic 
intervals, each comprising 3 to 42% sequence 
divergence. We constructed a yeast strain that 
carries a green fluorescent protein (GFP) re- 
porter driven by a DNA response element for 
these DBDs and confirmed that all recon- 
structed ancestral DBDs bind to it, as expected 
based on prior studies (25). GFP fluorescence 
in this strain correlates well with binding af- 
finity, which was previously measured using 
fluorescence anisotropy (fig. S4D). 

For each of the seven ancestral and two 
extant DBDs, we generated a library of variants 
that contains all 19 possible amino acid muta- 
tions at all 76 sites (fig. S3). We used a bulk assay 
of fluorescence-activated cell sorting coupled 
with deep sequencing to quantify the GFP 
fluorescence of each variant with very high 
repeatability [Pearson’s correlation coefficient 
squared (7?) = 0.99 across three replicates; 
Fig. 1C and figs. S4 and S5]. We calculated 
the effect of a mutation as the difference in the 
mean 1log;9-GFP fluorescence (AF) between 
variants that differ by a single amino acid; we 
applied this approach to all mutations from 
the wild-type amino acid in any of the nine 
DBDs to all other 19 amino acids. 


< |C. teleta SR 


Protostome SRs 


Hemichordate SRs 
Chordate ERs 


Chordate kSRs 


<| Human GR 


Placozoan receptors 


0.5 subs/site 


Fig. 1. Phylogenetic DMS. (A) Phylogeny of the DBD of steroid and related 
receptors. Circles represent DBDs characterized here by DMS. The complete 
phylogeny is shown in fig. Sl. ERs, estrogen receptors; kSRs, ketosteroid 
receptors, including GR; SRs, steroid receptors; subs/site, substitutions per site. 
(B) Phylogenetic relations among the nine characterized DBDs. Colors distinguish 
trajectories to C. teleta SR and human GR. Sequence divergence (percent) 

and number of sequence differences (in parentheses) in each interval are shown. 
(C) Sort-seq assay for DBD activity. For each DBD, a library containing all 
possible single-amino-acid mutations was generated using microarray-based 
synthesis and cassette assembly (fig. S3) and cloned into yeast carrying a GFP 
reporter (left and middle). The activity of each mutant was measured by sorting 
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Differences in the effect of a mutation 
between successive nodes on the phylogeny 
(AAF) indicate that the mutation interacts 
with historical substitutions that occurred 
during that interval (Fig. 1D). We normalized 
the effects of the mutations to remove global 
background-dependence caused by different 
wild-type activity levels (fig. S6). After this 
correction, differences in a mutation’s effect 
among the nine DBDs are attributable to 
specific epistatic interactions with intervening 
substitutions on the phylogeny. 


Pervasive random changes in the effects 
of mutations 


To analyze the evolutionary dynamics of epista- 
sis over time, we adapted a classic quantitative 
framework for modeling trait evolution on 
phylogenies (26, 27), including the extent, 
direction, and rate of evolutionary change of 
the trait; the underlying genetic architecture; 
and the relative roles of selection and genetic 
drift. Our approach treats the phenotypic ef- 
fect of each mutation as a trait that changes 
probabilistically across phylogenetic intervals, 
allowing us to ask these questions about epi- 
static change during historical DBD evolution. 

Sixty percent of all mutations display sig- 
nificantly different effects among the nine back- 
grounds, and 22% differ in the direction of their 
effects [false discovery rate (FDR) < 0.1; Fig. 2A]. 


Most of the mutations that show no evidence 
of epistasis destroy protein function regard- 
less of genetic background (AF always at the 
lower bound of measurement, -1.3). Only 5% 
of mutations have a nondestructive effect that 
did not vary significantly across the phylogeny. 

Epistatic changes occurred during all eight 
phylogenetic intervals (Fig. 2B and fig. S7A). 
Even in the shortest interval, during which 
there were only two sequence substitutions, 
the effects of more than 200 mutations changed 
significantly. During the other intervals, even 
more mutations changed in effect. On average, 
each substitution is associated with significant 
changes in the effects of about 60 mutations, 
about 2% of the total set of possible mutations 
(fig. S7B). 

These epistatic changes were unbiased over 
time. Changes in the effects of mutations (AAF) 
are distributed almost symmetrically around 
0 (mean = -0.01; Fig. 2C). The fraction of 
mutations that reduce activity was nearly 
constant among the nine intervals, as was the 
fraction of mutations that destroy activity (Fig. 
2D). No individual mutations had effects that 
changed with a significant bias in either direc- 
tion over time (fig. S7C). These data indicate 
that directional selection did not drive long- 
term epistatic changes in the effects of muta- 
tions, and mutational robustness did not change 
systematically over time. Further, the variance 
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the library of cells into fluorescence bins, inferring the distribution of each 
mutant among bins by sequencing, and calculating the mean logio-GFP 
fluorescence (F). Hypothetical distributions for three variants with high, medium, 
and low F are shown (right). ERE, estrogen response element. (D) Tracing 
epistatic change across the phylogeny using example mutation SOP. The effect on 
each DBD's activity (points) was quantified as the change in mean logyo-GFP 
fluorescence (AF). The x axis shows each DBD in order on the phylogeny, positioned 
by sequence divergence and colored by trajectory. AAF represents the change in 
the mutation’s effect between a pair of DBDs, which is caused by epistatic 
interactions with intervening substitutions. Error bars represent SEM (n = 3). 
Dashed lines indicate upper and lower measurement bounds. 
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of the distribution of AAF in each interval 
increased linearly with sequence divergence, 
rather than plateauing (Fig. 2, E and F); this 
result suggests that stabilizing selection did 
not maintain the effects of mutations at inter- 
mediate values within defined limits. 


The effects of most mutations drifted gradually 


To test whether epistatic change was gradual 
or episodic, we fit probabilistic models of trait 
evolution to the trajectory of changes in the 
effect of each mutation. Brownian motion rep- 
resents a simple model of gradual evolution 
at a constant rate without directional bias: 
Changes in the trait value among phylogenetic 
intervals are normally distributed when nor- 
malized for the length of the interval, with a 
mean change of zero and constant variance per 
unit sequence divergence (which represents the 
rate of evolution). In the alternative model of 
episodic evolution, the normalized variance is 
a free parameter for each interval, which allows 
the rate to differ among intervals (Fig. 3A). 
We fit both models to the eight AAF values of 
each mutation and used a likelihood-ratio test 
to compare the fit of the two models. 
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We found that the Brownian motion model 
was the best-fit model for 92% of mutations 
that changed epistatically (Fig. 3, B and C), 
irrespective of whether the effects of the mu- 
tations changed rapidly or slowly (Fig. 3D). For 
the 8% of mutations best fit by the episodic 
model, effects were nearly constant in most 
intervals, with substantial changes during one 
or a few intervals. The functional effects of 
most mutations therefore evolved as a random 
variable that changes gradually along the 
phylogeny at a characteristic rate and without 
bias. We call this process epistatic drift. 

Phylogenetic cross-validation confirmed that 
the effects of most mutations evolved at a 
steady rate across the phylogeny (Fig. 3E). 
For each mutation, we predicted the epistatic 
change expected in each of the eight intervals 
given the rate of epistatic change estimated 
from the seven other intervals; we then com- 
pared these predictions with experimental 
observations, pooling mutations with similar 
estimated rates (Fig. 3F). Predicted and observed 
epistatic changes were strongly correlated 
(Spearman’s p = 0.94 for every interval; Fig. 


3G), indicating that the mutations’ relative 
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rates of epistatic change did not strongly vary 
along the phylogeny. The absolute rate of total 
epistatic change, however, was systematically 
faster than predicted in some intervals and 
slower in others: The mean rate of epistatic 
change for all mutations in each interval 
ranges from 0.7 to 1.4 of the average across 
the phylogeny (Fig. 3H). Epistatic change in 
the effect of each mutation therefore varies 
stochastically across intervals (consistent with 
Brownian motion), but this variation is cor- 
related among mutations; as a result, the total 
amount of epistatic change across all muta- 
tions is systematically greater in some inter- 
vals than in others. This pattern is likely to arise 
because the total epistatic change depends on 
the particular substitutions that fixed during 
an interval, and some substitutions are more 
epistatic than others because they interact more 
strongly or with a larger number of mutations. 
The mean rate of epistatic change was not 
systematically different during intervals that 
followed gene duplications. 

These observations have two major implica- 
tions for evolution and the genetic architecture 
of epistatic interactions (Fig. 31). First, epistatic 
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Fig. 2. Pervasive random changes in the effects of mutations. (A) Maximum 
and minimum effect of each mutation (points) across the nine DBDs, colored 
according to the stacked column on the right, which shows the proportion 

of mutations in four categories: magenta, significant effect of DBD background 
on AF and the sign of AF different between the maximum and minimum; red, 
significant effect of background but no sign difference; black, no significant 
effect of background and AF within measurement limits; and cyan, AF at the 
lower bound of measurement in all nine DBDs. Significance was evaluated by 
Welch's analysis of variance, Benjamini-Hochberg FDR < 0.1. Dashed lines 
indicate measurement bounds. (B) Number of mutations in each phylogenetic 
interval that changed significantly in AF (heteroscedastic t test between parent 
and child node, FDR < 0.1) plotted against the number of amino acids that 
diverged in the interval. (C) Distribution of epistatic change in the effect of 
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every mutation during every phylogenetic interval (AAF). Dark gray indicates a 
AAF significantly different from zero. Mutations that were always at the 

lower bound of measurement were excluded. (D) Fraction of mutations in each 
DBD with a AF less than zero (circles) or a AF at the lower bound (L.B.) of 
measurement (triangles). (E) Distribution of AAF of all mutations for the 
protostome-annelid interval (left) or the AncSR1-human GR interval (right). 
The variance of the distribution (Var) quantifies the total epistatic change in the 
effects of all mutations during an interval. d, sequence divergence. (F) Total 
epistatic change as a function of sequence divergence across the phylogeny. 
Red dots represent each of the eight independent phylogenetic intervals 
between characterized DBDs; black dots represent all composite intervals. 
Dashed lines indicate the best-fit power function for all (black) or the 

eight independent (red) intervals. 


20 MAY 2022 * VOL 376 ISSUE 6595 825 


RESEARCH | RESEARCH ARTICLES 


Medel Distr. of Course of change in B 15 ar Cc Gradual mutations Episodic mutations 
unitAAF — mutation’s effect Ss w Episodic 
= (FDR = 0.2) 0.4 0.8 
Gradual 2 1 2 > 
(Nul) af /\ AF Neat E 2 
te I ge ® 
1 {o) TC 
2 5 4 
Episodic 9 ° r 
Alternative) 3 f AF ~ o 
ECU) | es fo 0+. - 0 
Seq. div. 0 0.5 1 -3 ) 3 -3 0) 3 
Unie p-value Normalized AAF 
? ju 
Norm. AAF ! 0 : 
0.4 
0 
AF -0.4 
-0.8 
-1.2 
M64W (p = 0.73) S17T (p = 0.46) K64I (p < 0.01) S42T (p < 0.01) 
E F Predicting AAF for interval 1 H 
Estimate rate 6 Decile, slowest rate Decile, median rate Decile, fastest rate = sot 1.5 
1 TCs 3 2 -© Predicted as 
7 8 g Observed 2 1 7 a 
| 8 28 
d ORS 
a © wm 0.5 
: 2 © 
Predict AAF x ‘| = 2 | 
oO 
oN T 1 i T 1 Ss 0 
0 1 ad . 
G I 
Interval: 1 


Bf Substitutions or Focal mutation — Epistatic effects 


Observed epistatic change (Var[AAF]) 


Gradual, rapid Episodic, rapid 


Qo ' o \ 
re} xe} 
5 i —_ 
2) a 
Ne : > se LA ma 
) 0 
Unity AAF Unit AAF 


Gradual, slow Episodic, slow 


¢ 


i 1 
x.) 2 H 7) \ 
ot no} 2 
QS 3} fl Z| Al 
e LANA 


0 0 
Unit AAF Unit AAF 


ge 


7 
0.25 0 
Predicted epistatic change (Var[AAF]) 


rm 1 T 
0 0.25 0 0.25 0 


Fig. 3. Effects of most mutations changed gradually at characteristic 
rates. (A) Models of the tempo of epistatic change. In the null model, the 
amount of change in a mutation’s effect per substitution in an interval (unit AAF) 
is randomly drawn from a normal distribution centered at 0; the variance 

is the same among intervals, so the mutation’s effect changes gradually at 

a constant expected rate as substitutions accrue. In the alternative model, 
the variance may differ among phylogenetic intervals (cyan versus red), 
eading to episodic changes in a mutation’s effect. Seq. div., sequence 
divergence. (B) Distribution of the p value of the likelihood-ratio test comparing 
gradual and episodic models for each mutation. Darker gray indicates 
mutations for which the gradual model is rejected (FDR < 0.2). Mutations 
that were always at the lower bound of measurement were excluded from 
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this analysis. (C) Distribution of the normalized amount of epistatic change in 
each interval, for all mutations better fit by the gradual model (left) or the 
episodic model (right). Normalized AAF is the AAF of a mutation in an interval 
divided by od’/*, where o is the mutation’s average rate of epistatic change 
and d is the length of the interval. Gray columns are observed data; the red line 
shows the distribution expected under the null model. (D) Trajectory of 
changes in the effect of two sets of example mutations that are better fit 

by the gradual model (left) or the episodic model (right); in each category, 
one evolves rapidly and the other slowly. Each mutation’s p value in the 
likelihood-ratio test is shown. The gray box shows normalized AAF across 
each of the eight intervals. Dashed lines indicate measurement bounds. 

(E) Phylogenetic cross-validation. In the example shown, AAF in interval 1 is 
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interactions within the DBD are dense: The 
effects of most mutations changed gradually 
because of weak interactions with many substi- 
tutions, and each substitution typically modi- 
fied the effects of many mutations (Fig. 2B). If 
most epistatic changes were triggered by rare, 
large-effect modifiers, the distribution of AAF 
would be enriched near zero and at extreme 
values, a pattern that we observed for only 
a small fraction of mutations. Most historical 
contingency is therefore the cumulative result 
of many small-effect epistatic modifications. 
Against this background of gradual epistatic 
drift, a few mutations occasionally undergo sub- 
stantial changes in their effects. 

Second, some mutations are more epistatic- 
ally sensitive than others, with effects that 
diverged more rapidly as substitutions accu- 
mulated. Conversely, some substitutions are 
more epistatic than others, changing the effects 
of more target mutations or causing changes 
of greater magnitude. As a result, there are sys- 
tematic differences among intervals in the aver- 
age rate of epistatic change across all mutations. 


Mutations vary in memory length and the time 
scale of contingency 


Because the effect of each mutation drifts at ran- 
dom at a steady rate, there should be a charac- 
teristic time period after which the mutation’s 
effect can no longer be reliably predicted from 
its known effect at some other time. We call 
this period the mutation’s memory length, the 
measure of which is the memory half-life—the 
amount of sequence divergence over which 
the correlation of a mutation’s effect with its 
initial effect is reduced by half. To estimate 
memory half-life, we partitioned mutations into 
deciles by their rate of epistatic change and cal- 
culated for each decile how correlated the effects 
of mutations are between each pair of DBDs 
(Fig. 4, A to D). We modeled the correlation co- 
efficient as an exponentially decaying function 
of sequence divergence. We then used this rela- 
tionship to estimate the memory half-life of each 
mutation from its rate of epistatic change. 
Reflecting the wide variation in the rate of 
epistatic change among mutations, memory 
half-lives range from just 3% sequence divergence 
to virtually infinite (Fig. 4E). Mutations with 
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predicted from the average rate of epistatic change measured across intervals 
2 to 8 (gray box). (F) Distribution of observed AAF during interval 1 (gray 
columns) and predicted by cross validation (red line). Mutations were grouped 
into deciles by their rate of epistatic change across intervals 2 to 8; predictions 
are shown for deciles with the slowest, median, or fastest rates. (G) The 
elative rates of epistatic change for mutations are consistent across 
phylogenetic intervals. Points represent deciles of mutations grouped by the 
predicted rate of epistatic change; observed epistatic change is plotted against 
that predicted by cross-validation. The dashed line indicates the linear 
egression. (H) Among-interval differences in average rate of epistatic change. 
Each column shows the mean rate of epistatic change of all mutations in 

one phylogenetic interval, normalized so that the mean across all intervals 


the shortest half-lives therefore forget the 
effects they had in the past after just a few 
sequence substitutions at other sites. At any 
moment, their effect and likely fate depend 
primarily on the substitutions that occurred 
most recently during their history. 

Relative to the time scale of DBD evolution, 
about one-fourth of all mutations have short 
memories (half-life < 50% sequence divergence); 
in this group, the effects in present-day human 
GR are almost completely independent of their 
initial effects in AncSR (r” = 0.10; Fig. 4F). 
Twenty percent of mutations have medium 
memory (half-life 50 to 200%), with present- 
day effects that can be partially predicted from 
their initial effects (7° = 0.68). The remaining 
54% of mutations have long memories (>200% 
divergence) and interacted negligibly with his- 
torical substitutions, retaining their initial ef- 
fects throughout DBD evolution (7? = 0.98). 


Contingency of historical sequence evolution 


We next focused on the subset of mutations that 
became substitutions during historical DBD 
evolution. We first assessed the functional effects 
of the 79 substitutions that occurred during the 
phylogenetic intervals that we experimentally 
characterized (Fig. 5, A and B). When measured 
in the ancestral background in which they his- 
torically occurred, substitutions that reduce 
activity by a AF less than -0.2 were nearly 
absent; of the few exceptions, most fixed during 
intervals immediately after gene duplication 
(fig. S8A). This represents a 29-fold depletion 
compared with the set of all mutations, most 
of which have a AF less than —0.2. These 
results imply that the DBD evolved primarily 
under purifying selection against mutations 
that strongly reduce activity, and they estab- 
lish a AF of -0.2 as a boundary that roughly 
defines the evolutionary accessibility of muta- 
tions under purifying selection. 

Epistasis shaped the fate of most historical 
substitutions, which occurred during limited 
windows when they were transiently acces- 
sible. Of all substitutions that fixed between 
AncSR and any extant steroid receptor on our 
phylogeny, 43% have short or medium mem- 
ories (Fig. 5C). Among the short-memory sub- 
stitutions, most were inaccessible in AncSR 


equals one. Error bars represent the estimated standard deviation obtained 
by bootstrap-resampling of mutations. Asterisks indicate the intervals 
immediately following gene duplication. (I) Inferring the architecture of 
epistatic interactions between substitutions (subs; black boxes) and a focal 
mutation (star) from phylogenetic DMS. Gradual changes in the mutation’s 
effect during evolution arise if many substitutions act as epistatic modifiers 
(arrows, with thickness showing the strength of interaction), yielding a normal 
distribution of AAF per substitution (left). Episodic changes arise from 
interactions with only a few substitutions, yielding a distribution heavy at 
zero and the tails (right). In either case, strong versus weak interactions 
cause rapid (top) versus slow (bottom) epistatic change. The fraction of all 
mutations in each category in our experiments is shown. 


(AF ancsr < —0.2), implying that they became 
accessible in one or more descendant proteins 
because of permissive epistatic substitutions, 
which render otherwise deleterious mutations 
neutral or advantageous. The remaining short- 
memory substitutions were accessible in 
AncSR (AF'ancsr 2 —0.2), but almost all of these 
became subsequently inaccessible because 
of restrictive substitutions, which render pre- 
viously neutral or advantageous mutations 
deleterious (fig. S8B). By contrast, 95% of long- 
memory substitutions were accessible in AncSR 
and remained so across the entire phylogeny 
(fig. S8B). Medium-memory substitutions dis- 
played an intermediate pattern. The evolu- 
tionary fate of long-memory substitutions could 
therefore have been reliably predicted from 
their initial effects, but the accessibility of 
substitutions with short or medium memory 
depended on other substitutions that occurred 
during history. 

Epistasis also shaped the fate of the many 
mutations that did not become substitutions. 
Of all short-memory mutations that were 
accessible in AncSR, 90% became inaccessible 
in one or more descendant proteins, indicating 
that evolutionary paths to them were closed by 
restrictive substitutions (Fig. 5D). Conversely, 
55% of the short-memory mutations that were 
inaccessible in AncSR subsequently became 
accessible because of permissive substitutions. 
Overall, two-thirds of short-memory mutations 
and one-third of medium-memory mutations 
changed in accessibility among the DBDs we 
tested; each category of mutations was acces- 
sible in an average of 2.4 and 4.9 of the 9 DBDs, 
respectively (Fig. 5E). 

These data indicate that epistatic interac- 
tions with the particular set of substitutions 
that occurred along the phylogeny contingently 
determined the evolutionary fate not only of 
the mutations that fixed historically because of 
permissive substitutions but also of those that 
did not have the opportunity to fix because of 
restrictive substitutions. Studying only the se- 
quence changes that occurred during evolution 
therefore underestimates the role of historical 
contingency: Doing so cannot detect the many 
evolutionary roads that were closed off con- 
tingently but which could have been taken if 
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Fig. 4. Memory length of mutations and the time scale of historical contin- 
utations. In (A), mutations 
were grouped into deciles by their rate of epistatic change (o, expected standard 
deviation of AAF per 1% sequence divergence). In (B), the effects of mutations in 
each decile were compared between every pair of DBDs; shown are comparisons 
C), the r? for each DBD 
hat pair. The dotted line 
ry half-life is the sequence 
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between AncSR and human GR (42% divergence). | 


pair was plotted against the sequence divergence of t 
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the trajectory of sequence changes at interact- 
ing sites had unfolded differently. 


Causes of variation in memory length 


Finally, we sought to identify the factors that 
determine a mutation’s memory length. Some 
variation in memory length is attributable to 
the sequence site at which a mutation occurs: 
The median memory half-life of mutations 
to any of the 19 mutant states at the same 
site varies among sites from 11 to >200% 
divergence (Fig. 6A). But this variation is not 
associated with any obvious structural or func- 
tional properties: The median memory half-life 
of a site is poorly correlated with relative 
solvent accessibility, rate of substitution, rate 
of substitution at physically adjacent sites, dis- 
tance to the DNA-binding residues, and dis- 
tance to the dimerization interface (7° < 0.1 for 
every factor; fig. S9). Further, memory length 
varies extensively within each site, with 59 of 
76 sites in the DBD containing both short- 
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and long-memory mutations. As a consequence, 
predicting a mutation’s memory half-life by 
the median of all mutations at that site achieves 
an 7° of only 0.25 (Fig. 6B). 

Another possibility is that certain types of 
mutations (to and from the same pair of states) 
might be consistently associated with memory 
length, irrespective of the sites at which they 
occur. Predicting the memory length of in- 
dividual mutations from the median memory 
length of all mutations of the same type at 
any site achieved an 7° of only 0.13 (Fig. 6C). 
Explaining memory length variation therefore 
requires analysis of each particular mutation 
at each site in the protein. 

To estimate a mutation’s memory length, 
experiments are necessary in multiple genetic 
backgrounds across a phylogenetic trajectory. 
But how many backgrounds are necessary? 
When the rate of epistatic change of mutations 
is estimated from two backgrounds that were 
randomly chosen from the nine we assayed, the 
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epistatic change and memory half-life inferred by fitting a power function to the 
mean rate of epistatic change and memory half-life of the deciles is shown in 
(D). This relationship was used to calculate the memory half-life of each mutation 
from its rate of epistatic change. (E) Distribution of memory half-life among 
mutations. Mutations were classified into short-, medium-, and long-memory 

offs of 50 and 200% divergence. (F) Comparing the effects of 
AncSR and human GR (42% divergence) for each memory 
are mutations with significant difference in AF (heteroscedastic 
black dots are those with no significant difference. 


correlation (7”) with the rate measured using 
all nine backgrounds is, on average, equal to 
0.40, and the rate of epistatic change is sys- 
tematically underestimated (Fig. 6, D and E). 
The correlation improves as more backgrounds 
are sampled and reaches 0.8 when estimates 
are based on five backgrounds. A moderate 
number of experiments is therefore sufficient 
to provide a rough estimate of a mutation’s rate 
of epistatic change and hence its memory length. 


Robustness to uncertainty in ancestral 
sequence reconstruction 


Our conclusions are robust to uncertainties in 
the reconstruction of ancestral sequences. The 
ancestral DBDs were generally inferred with 
high confidence and contained zero to 10 sites 
at which more than one amino acid state is 
plausible. For each ancestral DBD with more 
than one ambiguous site, we generated an “Alt- 
All” reconstruction, which contains the alter- 
native plausible amino acid at all ambiguously 
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Fig. 5. Impact on sequence evolution of memory length and initial 
functional effect. (A) The effect of a substitution at the time it fixed 
during history was calculated as the mean of AFs measured by DMS in the 
nearest ancestral and descendant nodes. (B) Comparing the effects of 

the 79 substitutions that occurred along the phylogenetic trajectories we 
characterized to the effects of all possible mutations. Substitutions are 
29-fold enriched for a AF greater than or equal to -0.2 compared with 
mutations, providing an estimate of the threshold of accessibility during 
DBD evolution (dashed line). (C) Distribution of the initial effect (AF on 
AncSR) of 275 substitutions that fixed between AncSR and any extant DBD 


reconstructed sites (28); this sequence repre- 
sents the least likely of all plausible recon- 
structions and allows a conservative estimate 
of robustness to sequence uncertainty. We then 
constructed a complete DMS library of each 
Alt-All protein and repeated all of our exper- 
iments and analyses. Although the effects of 
some mutations differ between the Alt-All and 
maximum-posterior-probability reconstruc- 
tions, all conclusions concerning the temporal 
dynamics of epistasis were unchanged (fig. S10). 


Discussion 


Prior experimental studies have identified cases 
in which the functional effects of a few muta- 
tions changed substantially during particular 
intervals of evolutionary history (12, 15-18, 29, 30). 
Our observations show that such rare, large- 
effect epistatic modifications occur against a 
background of pervasive gradual drift in the 
effects of most mutations (JI, 20, 31-36). Most 
epistatic changes were of small magnitude 
when they occurred, but across an evolutionary 
trajectory of moderate length (<50% sequence 
divergence), they were sufficient to completely 
or partially decorrelate the effects of most mu- 
tations from their initial effects and substan- 
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all, DBDs is shown. 


tially alter the set of available opportunities for 
future sequence change. The fold and function 
of all proteins depend on interactions among 
many residues, so we expect that epistatic drift 
will be a widespread feature of protein evolu- 
tion. The temporal dynamics and distribution of 
memory lengths may depend on each protein’s 
structure, function, and the selective regime 
under which it evolved. 

Our findings establish strong limits on the 
ability to predict future evolution and interpret 
evolutionary history, but they also provide a 
quantitative framework for understanding 
those limits. Classical evolutionary theories 
assume that the constraints imposed by puri- 
fying selection do not change as sequences 
diverge, so the effects and evolutionary fate 
of mutations can be predicted or retroactively 
inferred based on their effects measured in the 
present. Our results show that this assumption 
of constancy and independence is wrong for 
about half of DBD mutations, which have 
short or medium memories. Because epistatic 
modification occurs at a mostly constant rate 
for each mutation, however, an estimate of 
memory length from experimental data across 
phylogenetic time can quantify the extent to 


in our phylogeny. Distributions are shown by memory half-life category. 
Enrichment of substitutions with a AF greater than or equal to -0.2 

(dashed line) relative to mutations is shown. (D) The proportion of initially 
accessible mutations (AFancsr 2 —0.2) that become inaccessible in at 

least one descendant DBD is shown on the left. The proportion of initially 
inaccessible mutations that become accessible in at least one descendant DBD 
is shown on the right. (E) Distribution of the number of characterized DBDs in 
which each mutation is accessible (AF = —-0.2), classified by memory-length 
category. The percentage of mutations that were accessible in some, but not 


which any mutation’s effect can be predicted 
at any point in time, either future or past. 
Further, although point projections of the 
effects of short- and medium-memory muta- 
tions across long time scales are unreliable, a 
probability distribution of those effects can be 
generated if we know any mutation’s memory 
length and its effect at some other time. 
Ancestral protein reconstruction can replace 
predictions with experimental knowledge, but 
only for proteins in the past. 

A probabilistic description of contingency 
and uncertainty using memory length does 
not require detailed knowledge of the partic- 
ular genetic interactions that cause epistatic 
change. If we had microscopic knowledge of 
all the interactions that modify each mutation’s 
effect and a dense phylogenetic reconstruction 
of past trajectories of sequence change, we 
could reliably predict the effect of every possi- 
ble mutation in any past historical background. 
But even complete knowledge like this would 
not be sufficient to predict future evolutionary 
trajectories: the accessibility of each future mu- 
tation depends on the chain of epistatic substi- 
tutions that occur before it, many of which will 
occur by chance. We can use experimental and 
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Fig. 6. Variation of memory half-life of mutations among and within 
sites. (A) Distribution of memory half-life among sites. Each line shows the 
range of memory half-life of all mutations at one site in the DBD sequence. 
(B and C) Predicting the memory half-life of a mutation (points) by the 
median memory half-life of all possible mutations at the same site (B) or 
by the median of mutations of the same type (between the same wild-type 
and mutant amino acid) at all sites (C). Dashed lines indicate the linear 


phylogenetic data to tame evolutionary uncer- 
tainty by recognizing and quantifying it, but 
the future will always surprise. 


REFERENCES AND NOTES 


1. T.N. Starr, J. W. Thornton, Protein Sci. 25, 1204-1218 
(2016). 

2. Z.D. Blount, R. E. Lenski, J. B. Losos, Science 362, eaam5979 
(2018). 

3. D. Melamed, D. L. Young, C. E. Gamble, C. R. Miller, S. Fields, 
RNA 19, 1537-1551 (2013). 

4. C.A. Olson, N. C. Wu, R. Sun, Curr. Biol. 24, 2643-2651 
(2014). 

5. A. |. Podgornaia, M. T. Laub, Science 347, 673-677 (2015). 

6. K. S. Sarkisyan et al., Nature 533, 397-401 (2016). 

7. N.C. Wu, L. Dai, C. A. Olson, J. 0. Lloyd-Smith, R. Sun, eLife 5, 
e16965 (2016). 

8. G. Diss, B. Lehner, eLife 7, e32472 (2018). 

9. V. 0. Pokusaeva et al., PLOS Genet. 15, e1008079 (2019). 

0. A. S. Kondrashov, S. Sunyaev, F. A. Kondrashov, Proc. Natl. 
Acad. Sci. U.S.A. 99, 14878-14883 (2002). 

1. M. Lunzer, G. B. Golding, A. M. Dean, PLOS Genet. 6, e1001162 
(2010). 

2. C. Natarajan et al., Science 340, 1324-1327 (2013). 

3. M. B. Doud, 0. Ashenberg, J. D. Bloom, Mol. Biol. Evol. 32, 
2944-2960 (2015). 

4. H. K. Haddox, A. S. Dingens, S. K. Hilton, J. Overbaugh, 
J. D. Bloom, eLife 7, e34420 (2018). 

5. E. A. Ortlund, J. T. Bridgham, M. R. Redinbo, J. W. Thornton, 
Science 317, 1544-1548 (2007). 

6. J. T. Bridgham, E. A. Ortlund, J. W. Thornton, Nature 461, 
515-519 (2009). 

7. J.D. Bloom, L. |. Gong, D. Baltimore, Science 328, 1272-1275 
(2010). 

8. L. |. Gong, M. A. Suchard, J. D. Bloom, eLife 2, e00631 
(2013). 

9. C. Natarajan et al., Science 354, 336-339 (2016). 


830 20 MAY 2022 « VOL 376 ISSUE 6595 


20. T. N. Starr, J. M. Flynn, P. Mishra, D. N. A. Bolon, 

J. W. Thornton, Proc. Natl. Acad. Sci. U.S.A. 115, 4453-4458 
(2018). 
21. M. Karageorgi et al., Nature 574, 409-412 (2019). 


22. V. C. Xie, J. Pu, B. P. H. Metzger, J. W. Thornton, 


B. C. Dickinson, eLife 10, e67336 (2021). 
23. G. K. Whitfield, P. W. Jurutka, C. A. Haussler, M. R. Haussler, 
J. Cell. Biochem. 33, 110-122 (1999). 
24. C. Helsen et al., Mol. Cell. Endocrinol. 348, 411-417 (2012). 
25. A. N. McKeown et al., Cell 159, 58-68 (2014). 
26. P. H. Harvey, M. D. Pagel, The Comparative Method in 


Evolutionary Biology (Oxford Series in Ecology and Evolution, 
Oxford Univ. Press, 1991). 

27. L. J. Harmon, Phylogenetic Comparative Methods: Learning 

From Trees (CreateSpace, 2018). 

G. N. Eick, J. T. Bridgham, D. P. Anderson, M. J. Harms, 

J. W. Thornton, Mol. Biol. Evol. 34, 247-261 (2017). 

29. Z. D. Blount, C. Z. Borland, R. E. Lenski, Proc. Natl. Acad. Sci. 
U.S.A. 105, 7899-7906 (2008). 

30. M. J. Harms, J. W. Thornton, Nature 512, 203-207 (2014). 

31. |. S. Povolotskaya, F. A. Kondrashov, Nature 465, 922-926 

(2010). 

32. M. S. Breen, C. Kemena, P. K. Vlasov, C. Notredame, 

F. A. Kondrashov, Nature 490, 535-538 (2012). 


28. 


33. D. D. Pollock, G. Thiltgen, R. A. Goldstein, Proc. Natl. Acad. Sci. 


U.S.A. 109, E1352-E1359 (2012). 
34. 
U.S.A. 112, E3226-E3235 (2015). 

35. Z. R. Sailer, M. J. Harms, Proc. Natl. Acad. Sci. U.S.A. 114, 
1938-11943 (2017). 

A. J. Morrison, D. R. Wonderlick, M. J. Harms, Genetics 219, 
iyabl05 (2021). 


36. 


ACKNOWLEDGMENTS 


We thank the members of the Thornton laboratory for discussion 
and comments, R. Ranganathan and R. Kanan for assistance with 


P. Shah, D. M. McCandlish, J. B. Plotkin, Proc. Natl. Acad. Sci. 


Cc r=0.13 
S20] 
xe) 

& 150- fier 

2 yt 

= 100-4 ay 

o 7 i 

= PX, . j 

5 507° 4 H 

E =X f 

oO 0- a 3 i a. 

= T T T T T 
0 50 100 150 200 


Mutation-type median 


“HOPS 


\ } } 
3 4 5 


Number of backgrounds 


regression. (D and E) Effect of the number of DBDs characterized by DMS on 
estimates of rate of epistatic change. The rate of epistatic change of every 
mutation was estimated using a subset of the nine DMS experiments; the 
relationship between the estimated rate from each subset to that estimated 
from all nine experiments was analyzed by linear regression. The graphs 
show the distribution of correlation coefficient (D) and best-fit regression 
slope (E) across every possible subset of a given size. 


llumina sequencing, the University of Chicago Cytometry and 
Antibody Technology Facility (RRID: SCR_017760) for assistance 
with fluorescence-activated cell sorting, and the University of 
Chicago Research Computing Center for high-performance 
computing. Funding: This work was funded by National Institutes 
of Health (NIH) grants RO1GM131128 (J.W.T.), RO1GM121931 
(J.W.T.), ROIGM139007 (J.W.T.), and F32GM122251 (B.P.H.M.); 
and a Samsung Scholarship (Y.P.). Author contributions: 
Conceptualization: Y.P., B.P.H.M., J.W.T.; Experiments: Y.P.; Data 
analysis: Y.P.; Development of methodology: Y.P., J.W.T., B.P.H.M.; 
Writing — original draft: Y.P., J.W.T.; Writing - review and editing: 
Y.P., J.W.T., B.P.H.M.; Funding acquisition: J.W.T. Competing 
interests: The authors declare that they have no competing 
interests. Data and materials availability: All data and scripts are 
available in the main text or the supplementary materials. A 
duplicate is also available on GitHub (https://github.com/ 
JoeThorntonLab). License information: Copyright © 2022 the 
authors, some rights reserved; exclusive licensee American 
Association for the Advancement of Science. No claim to original 
US government works. https://www.science.org/about/science- 
licenses-journal-article-reuse = SS 


SUPPLEMENTARY MATERIALS 


science.org/doi/10.1126/science.abn6895 
Materials and Methods 

Figs. S1 to S10 

Tables S1 and S2 

References (37-54) 

MDAR Reproducibility Checklist 

Data S: 


Submitted 14 December 2021; accepted 11 April 2022 
10.1126/science.abn6895 


science.org SCIENCE 


RESEARCH | RESEARCH ARTICLES 


STRUCTURAL BIOLOGY 


Structures of Tetrahymena’s respiratory chain reveal 
the diversity of eukaryotic core metabolism 


Long Zhou'+, Maria Maldonado*{, Abhilash Padavannil’, Fei Guo®, James A. Letts?* 


Respiration is a core biological energy—converting process whose last steps are carried out by a chain 
of multisubunit complexes in the inner mitochondrial membrane. To probe the functional and structural 
diversity of eukaryotic respiration, we examined the respiratory chain of the ciliate Tetrahymena 
thermophila (Tt). Using cryo-electron microscopy on a mixed sample, we solved structures of a 
supercomplex between Tt complex | (Tt-Cl) and Tt-Clllz (Tt-SC I+lllz) and a structure of Tt-ClV2. Tt-SC 
I+IIl, (~2.3 megadaltons) is a curved assembly with structural and functional symmetry breaking. Tt-ClV2 
is a ~2.7-megadalton dimer with more than 50 subunits per protomer, including mitochondrial carriers 
and a TIM83-TIM133-like domain. Our structural and functional study of the T. thermophila respiratory 
chain reveals divergence in key components of eukaryotic respiration, thereby expanding our 


understanding of core metabolism. 


ellular respiration is an essential metabol- 
ic process that provides chemical energy, 
in the form of adenosine triphosphate 
(ATP), to all free-living eukaryotes. The 
final energy-converting steps of cellular 
respiration are carried out by the mitochondrial 
oxidative phosphorylation electron transport 
chain (mETC), which transfers electrons from 
reduced metabolites such as NADH (reduced 
nicotinamide adenine dinucleotide) or succinate 
to the final electron acceptor, O2. The mETC, 
located in the inner mitochondrial membrane 
(IMM), is composed of four protein complexes: 
NADH-ubiquinone (Q) oxidoreductase (complex I, 
CD, succinate dehydrogenase (complex II, CII), 
dimeric ubiquinol (QH2)-cytochrome c (cyt c) 
oxidoreductase (complex III5, CIII,), and cyto- 
chrome c oxidase (complex IV, CIV). By cou- 
pling the energetically favorable electron 
transfer reactions to H* pumping across the 
IMM, the mETC complexes CI, CIII., and CIV 
establish a proton motive force that is used by 
the ATP synthase complex (complex V, CV) 
to synthesize ATP (7). In all eukaryotes inves- 
tigated, the mETC complexes are found both 
individually and as parts of higher-order super- 
complexes (SCs) of defined stoichiometry (2), 
including SC I+III,, SC I+III,+IV, and SC 
III,+IV(2) (3). CIV and CV are also commonly 
found as dimers (CIV, and CV.) (4, 5) or, in the 
case of CV, higher-order arrays (6-9). 
Although the catalytic mechanisms of the 
individual mETC complexes are likely con- 
served across species, some structural differ- 
ences have been observed between model 
organisms. Each mETC complex can be split 
into core subunits, which are essential for 
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catalysis and are conserved from the bacte- 
rial homologs, and accessory subunits, which 
are important for the assembly and activity 
of the complexes but are not universally con- 
served (10, 11). CI is composed of 14 core sub- 
units, arranged into two “arms” of an “L”-shape, 
universally conserved in the model eukary- 
otes with known structures, covering the 
Opisthokont [fungi (72) and metazoan (73)] 
and Archaeplastida [red/green algae and plants 
(14-16)] clades. However, each structure of 
CI has a variable set of 31 to 37 accessory 
subunits, some of which are shared across all 
known examples of eukaryotic CI and some 
of which are only found in a specific branch of 
the eukaryotic tree of life (table S1) (77). For the 
other H*-pumping complexes of the mETC 
(CIII, and CIV), much less compositional 
variability has been observed between the 
characterized examples from opisthokonts 
(4, 18, 19) and archaeplastidans (20) (tables S2 
and S3). Although the biochemical and struc- 
tural characterization of eukaryotic CI, CII], 
and CIV has been limited to model organisms 
of the Opisthokonta and Archaeplastida clades, 
mitochondrial CV has been structurally char- 
acterized from a broader array of eukaryotes, 
including not only opisthokonts (5, 27) and 
archaeplastidans (22), but also kinetoplastids 
(23) and euglenids (24) (euglenozoans), as well 
as ciliates (9) and apicomplexans (8) (alveo- 
lates). These structures have revealed a broad 
diversity in subunit composition and higher- 
order organization of CV, correlated with dif- 
ferences in the organisms’ IMM morphology 
(7, 8, 23, 25). The diversity in CV and the IMM 
morphology across eukaryotic clades suggests 
that the limited diversity seen in CI, CIII,, and 
CIV may be attributed to the limited model 
eukaryotes in which they have been biochem- 
ically and structurally investigated. 

Early functional characterization of the mod- 
el ciliate Tetrahymena thermophila led to the 
conclusion that its mETC lacked CIV and con- 


tained a distinct terminal oxidase instead (26). 
However, the identification of T. thermophila 
homologs of CIV core subunits COX1 and 
COX2 (but not COX3) in the mitochondrial ge- 
nome (27) and the discovery that T: thermophila 
possesses an acidic rather than a basic cyt c 
(28) indicated that although T. thermophila 
does possess a CIV homolog, the complex has 
functionally diverged. Similarly, some but not 
all of the core subunits of CI and CIII, were 
identified in the T: thermophila mitochondrial 
genome (27) and proteome (29). This is in line 
with proteomic studies of CIII, and CIV com- 
position in alveolates Toxoplasma gondii and 
Plasmodium falciparum, which show a sub- 
stantial compositional divergence of these res- 
piratory complexes relative to organisms with 
known structures (30, 37). Furthermore, un- 
like the disc-shaped cristae of opisthokonts 
and archaeplastidans, the cristae of the ciliates 
and other alveolates are tubular (7). Given these 
functional, sequence, compositional, and mor- 
phological differences from previously studied 
organisms, 7: thermophila is an ideal system to 
examine the diversity, evolution, and conserved 
aspects of mETC complexes. 


The T. thermophila mETC requires a 
divergent cyt c 


To examine the functional consequences of the 
acidic versus basic charge flip of T: thermophila’s 
cyt c (Tt-cyt c), we characterized isolated mito- 
chondrial membranes from T. thermophila, 
mammals (Sus scrofa, pig), and plants (Vigna 
radiata, mung bean) using both mammalian 
cyt c purified from horse heart (Equus ferus, 
equine, Eq-cyt c) and heterologously expressed 
Tt-cyt c (fig. S1, A to E) (32). Spectroscopic 
activity assays for SC I+III, showed NADH- 
cyt c oxidoreduction using Eq-cyt c for all 
three species (fig. S1, F to J). However, CIV 
activity assays showed that whereas mam- 
malian and plant mitochondrial membranes 
readily oxidize reduced Eq-cyt c, no cyt c oxi- 
dation could be detected from T: thermophila 
mitochondrial membranes (Fig. 1A). Con- 
versely, using the recombinant Tt-cyt c, only 
the T. thermophila mitochondrial membranes 
showed cyanide-sensitive cyt c oxidation ac- 
tivity (Fig. 1A). These data establish that 
T. thermophila possesses a CIV that has func- 
tionally diverged from those of opisthokonts 
and archaeplastidans (fig. S1, K to M). 


High-resolution structures of the respiratory 
chain from a single mixed sample 


To better understand the functional diver- 
gence of the T: thermophila mETC complexes, 
we extracted them from mitochondrial mem- 
branes using digitonin, stabilized them in am- 
phipathic polymer A8-35, and enriched them 
using sucrose gradient ultracentrifugation and 
size exclusion chromatography (fig. S2). We 
then obtained the structures of Tt-CI, Tt-CII., 
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Fig. 1. Functional and structural divergence of T. thermophila’s electron transport chain. (A) Relative CIV 
activity of isolated mitochondrial membranes from T. thermophila (ciliate symbol, red), S. scrofa (pig symbol, yellow), 


and V. radiata 


plant symbol, green) using cyt c from horse (Eq, horse symbol, dotted) or T. thermophila (Tt, 


ciliate symbol, solid) in the presence or absence of 8 uM potassium cyanide (KCN). Data are averages + SEM (n = 4). 
(B) Representation of T. thermophila’s mitochondrial electron transport chain on a tubular crista, with the width 


of the membra 


ne determined by the observed phospholipids and hydrophobic regions of the complexes. Complexes 


are shown in surface (blue, Tt-Cl; green, Tt-Cllls; pink, Tt-ClV2; yellow, Tt-CV2); PDB 6YNY (9). Lipids are shown 


as spheres, co 
The arrow sho! 


ored by heteroatom. Cyt c homology model is shown in surface, colored by electrostatic potential. 
ws approximate binding sites for cyt c on Tt-Clll2 and Tt-ClV2. IMM, inner mitochondrial 
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S, intermembrane space; pmf, proton motive force. (C) Cryo-EM map of Tt-SC I+lllz colored 
ubunits. (D) Cryo-EM map of Tt-ClV2 colored by individual subunits. 


and Tt-CIV, from a single mixed sample by 
single-particle cryo-electron microscopy (cryo- 
EM) (Fig. 1, B to D, figs. S3 to S6, and tables S4: to 
$7). Our dataset also allowed us to obtain a 
high-resolution reconstruction of Tt-CV, (fig. S5 
and supplementary text), whose structure has 
been previously characterized (9). Thus, the 
current study reconstructed an entire metabolic 
pathway (NADH to ATP) involving 277 pro- 
teins in three multi-MDa complexes from a 
single sample (Fig. 1B). 

Tt-CI and Tt-CII, form a supercomplex (Tt- 
SC I+III,) (Fig. 1C and movie S1) while Tt-CIV 
forms a dimer (Tt-CIV.) (Fig. 1D and movie 
$2), resolved to nominal resolutions of 2.6 A 
and 3.0 A, respectively (figs. S3 to S6). The 
subunit composition of the complexes was de- 
termined from the cryo-EM density maps (33) 
and confirmed by mass spectrometry analysis 
(tables S5 to S7). Using this approach, we were 
able to structurally identify 70 proteins previ- 
ously annotated as putative or uncharacter- 
ized in the sequence databases. Tt-SC I+III, 
is composed of 91 protein subunits and has an 
overall mass of ~2.3 MDa (Fig. 1C and tables S5 
and S6), ~900 kDa larger than the mammalian 
SC I+III, (67 subunits, 1.4 MDa) (34). The po- 
sition and relative orientation of CI and CII, 
in Tt-SC I+III, are analogous to the conserved 
orientation of other eukaryotes (fig. S7A) (2), 
suggesting either acommon SC ancestor or 
convergent evolution across Opisthokonta, 
Archaeplastida, and Alveolata. In contrast to 
mammalian, yeast, and plant CIV with 10 to 
14 subunits (table S3) (4, 20, 35), Tt-CIV> has 
an overall mass of ~2.7 MDa with more than 
52 subunits per protomer (Fig. 1D and tables 
S3 and $7). 

Unlike the planar architecture of previ- 
ously described supercomplexes (fig. S7B) 
(34), Tt-SC I+III, and Tt-CIV, are curved 
assemblies (Fig. 1B), as demonstrated by the 
orientation of individual complexes (fig. S7, 
B to D) and the modeled lipids (Fig. 1B). When 
Tt-SC I+IIIo, Tt-CIV., and Tt-CV. structures 
were fit into a circular membrane correspond- 
ing to the known diameter of the tubular ciliate 
cristae (~40 nm) (7), the modeled lipids (Fig. 
1B) and hydrophobic regions of the proteins 
(fig. S7, B to D) matched the predicted curva- 
ture of a ~3-nm bilayer. The mETC complexes 
thus contribute to the tubular morphology of 
the ciliate cristae via specific structural fea- 
tures that are not conserved in opisthokonts 
or archaeplastidans. 


Tt-Cl has a fragmented core and 
51 accessory subunits 


Unlike all previous structurally characterized 
CIs, three core membrane subunits of Tt-CI 
responsible for Q binding and H* pumping 
(ND1, ND2, and ND5) are each split into two 
subunits, making a total of 17 core subunits 
across both CI arms (Fig. 2A). Comparison of 
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the structures of these split subunits with the 
core subunits of other CIs indicates that sub- 
units NDIA and NDIB, ND2A (formerly YMF65) 
and ND2B (formerly ND2), and ND5A and 
ND5B (formerly YMF57) come together to 
form structural elements equivalent to NDI, 
ND2, and ND5, respectively (fig. SSA and sup- 
plementary text) (27). Our structure also con- 
firmed the previous speculative assignments 
of YMF62 and YMF58 as core subunits ND6 
and NDAL, respectively (table S5) (27). Not- 
withstanding these differences, the relative 
locations of the flavin mononucleotide (FMN) 
cofactor and eight iron-sulfur clusters of the 
peripheral arm are highly conserved (fig. S8B). 
In addition, the hydrophilic axis along the 
center of the membrane is maintained, al- 
though it now involves residues of the ad- 
ditional subunits ND1B and ND2B (Fig. 2B). 
Furthermore, the positions of key Q reduc- 
tion site loops (NDUS7, NDUS2 B1-B2, NDIA 
TMH5-6, ND3 TMH1-2, ND6 TMH3-4) are 
well resolved (Fig. 2C) and consistent with 
positions of the equivalent loops in the mam- 
malian “closed” state structure (fig. S9) (3). 
Among the 51 accessory subunits present in 
Tt-ClI (Fig. 2D), 26 have homologs in all other 
known eukaryotic CIs, five subunits have ho- 
mologs outside of the alveolates, and 20 are 
newly identified here as CI subunits (fig. S10, 
tables S1 and S5, and supplementary text). The 
conservation of subunits across Opisthokonta, 
Archaeplastida, and Alveolata defines a mini- 
mal set of 31 accessory subunits present in CI 
of the last eukaryotic common ancestor (LECA) 
(fig. S10 and table S1). As seen in the archae- 
plastidan structures (14-16), Tt-CI contains a 
y-carbonic anhydrase (yCA) domain composed 
of three yCA subunits (Fig. 2E). Unlike vas- 
cular plants (/4, 16) but like the nonphotosyn- 
thetic alga Polytomella (15), none of the Tt-yCA 
subunits (NDUCAI to 3) retain their catalytic 
Zn**-coordinating residues (fig. S11, A and B). 
The ferredoxin bridge domain connecting the 
peripheral and membrane arms in plant and 
algal CI (15) is also present in Tt-CI (Fig. 2F). In 
addition to the T. thermophila homologs of 
the ferredoxin-like subunit (NDUFX), the acyl 
carrier protein subunit (NDUAB1-q), and ac- 
cessory subunit NDUAG, the bridge in Tt-CI 
is further stabilized by a J-domain subunit 
(coined NDUJ1) and the extended C terminus 
of NDUCA3 (Fig. 2F). Whereas the plant and 
algal CI-FDX has lost its ability to coordinate 
an iron-sulfur cofactor (15), we observe clear 
density for a 2Fe2S cluster in T: thermophila’s 
NDUFX (figs. S7E and S11, C and D), indicating 
that it may be a functional redox cofactor. 


Tt-Cl lacks an A/D transition and has crowded 
Q-site loops 


Structural analyses of yeast and mammalian 
CI have revealed that the complex adopts dis- 
tinct “open” and “closed” conformational states 
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Fig. 2. Structural and functional features of Tt-Cl. (A) Tt-Cl’s 17 core subunits in colored surfaces; accessory 
subunits are shown as gray cartoons. Core subunits ND1B, ND2B, and NDSB are highlighted with black borders. 
(B) Hydrophilic axis along the membrane arm of Tt-Cl. Core subunits that form the membrane arm are shown as 
cylinders, colored as in (A). Core subunits ND1B and ND2B are highlighted with black borders. Conserved 
charged residues along the hydrophilic axis as sticks, colored by subunit. (C) Q-site loops are shown as cartoons 
embedded in the density, colored by subunit. Terminal iron-sulfur cluster N2 is shown as spheres, residues as sticks. 
(D) The 51 accessory subunits are shown in colored surfaces, core subunits as gray transparent cartoons. 


(E) Tt-Cl's three y-carbonic anhydrase subunits are shown 
ferredoxin bridge shown as colored cartoons on the Tt-C 


characterized by a global change in the angle 
between the two arms, as well as order-to- 
disorder rearrangements of the Q-reduction 
site loops (e.g., the ND3 TMH1-2 loop, Fig. 3A) 
and an o-helix to n-bulge conversion in ND6 


as colored cartoons on the Tt-Cl surface (blue). (F) Tt-Cl’s 
surface (blue). 


TMH3 (fig. S9) (13, 34, 36, 37). These conforma- 
tional changes have been proposed to be part 
of CI’s coupling mechanism between Q re- 
duction and H* pumping (13, 38). However, 
yeast and mammalian CIs are also capable of 
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Fig. 3. Structural and functional comparison of complex I’s active-to-deactive 
(A/D) transition. (A and B) Comparison of complex I's ND3 TMH1-2 loop in 

T. thermophila (A) and O. aries (B) in the ordered (“closed,” PDB 6ZKO) 
and disordered (“open,” PDB 6ZKS) conformations. (€ to F) Functional 
characterization of A/D in isolated mitochondrial membranes of S. scrofa [(C) and 
(D)] and T. thermophila [(E) and (F)] by spectroscopic measurement of NADH 
dehydrogenase activity at 340 nm in the presence of the indicated concentrations 
of N-ethylmaleimide (NEM) [(C) and (E)] or MgCle [(D) and (F)] with preincubation 


undergoing an “active-to-deactive” (A/D) tran- 
sition, whereby CI enters an off-pathway de- 
active (D) state that is incapable of catalytic 
turnover (39). Therefore, it has been contro- 
versial whether the structural rearrangements 
of the open and closed states (fig. S9) are cen- 
tral to CI turnover or represent A and D states 
(13, 40). The A/D transition is not a universal 
feature of CI, as several bacterial and inverte- 
brate CIs lack a D state (39). Structural charac- 
terization of a CI lacking the A/D transition 
would address this controversy: If these struc- 
tural rearrangements were required for CI’s 
catalytic mechanism, they should be possible 
even in CIs that lack an A/D transition. 

Upon entering the D state—for example, by 
incubating the mitochondrial membranes at 
37°C in the absence of substrate—a conserved 
cysteine residue in CI’s ND3 TMH1-2 loop 
(Cys“ of ND3 in T. thermophila; Fig. 3, A 
and B) becomes accessible to modification by 
N-ethylmaleimide (NEM) (42). NEM modifi- 
cation of this cysteine traps CI in the D state 
and reduces overall activity (fig. S12). The det- 
rimental effect of NEM on mammalian CI 
activity can be prevented by pre-activating the 
complex with a small amount of NADH sub- 
strate (Fig. 3C). This outcome can be under- 
stood structurally: When mammalian CI is in 
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the “closed” state, the ND3 TMH1-2 loop is 
ordered and the cysteine residue (Cys®? of ND3 
in O. aries) is buried in the structure, whereas 
when Cl is in the “open” state, the ND3 TMH1-2 
is disordered and the cysteine is accessible 
to NEM (Fig. 3B). A similar effect can be seen 
with the addition of Mg**, which is known to 
inhibit the A/D transition in mammals and 
yeast (Fig. 3D) (39). When treated identically 
to the mammalian sample to induce entry 
into the D state (i.e., incubation without sub- 
strate at 37°C for 10 to 20 min), the CI activity 
of T. thermophila’s mitochondrial membranes 
was insensitive to NEM or Mg”* (Fig. 3, E 
and F). These data indicate that, like bacteria 
and invertebrates (39), Tt-CI does not possess 
an A/D transition. Furthermore, in Tt-CI, the 
presence of additional T:. thermophila-specific 
subunits (NDUJ1, NDUTX, and NDUCA3) and 
of extensions in NDUA6 and NDUA9Q leads to 
crowding of the region surrounding the ND3 
TMH1-2 loop (Fig. 3G). As a result of this 
crowding, any conformational changes of the 
ND3 TMH1-2 loop during the catalytic cycle 
would require rearrangements of the sur- 
rounding structure. For example, the observed 
position of Tt-CI’s ND3 TMH1-2 loop conflicts 
with the “up” position of the ND1 TMH5-6 
loop seen in the O. aries complex I bound to 


0 
NEM (mM) 0 0.52.0 0 0.52.0 
+ + + 


Cl specific activity 9 
(nmol/min*mg) 


0 
MgCl, (mM) 0 2.55.0 0 2.55.0 
5 UMNADH- - - + + + 


H NDS TIMEME2 loop 


ND3 NDA O.a.ND1 "up" 


with 5 uM NADH or water. Data are averages + SEM; n = 3 or 4. (G) Crowding 
around Tt-Cl’s ND3 TMH1-2 loop. ND3 is shown in blue surface. Secondary structure 
elements of various accessory subunits that pack against the ND3 loop are shown 
as colored cartoons. Accessory subunits are colored as in Fig. 2. (H) Superposition 
of T. thermophila ND1 (light purple cartoon) with O. aries ND1 in NADH-bound, 
open-state Cl (“up” conformation, red cartoon, PDB 6ZKH). O. aries ND1 up 
conformation would clash with ND3 TMH1-3 loop (blue surface) and NDUS2 
(yellow surface). The clash is marked with a red star. 


NADH (Fig. 3H) (72). Thus, if the “up” posi- 
tion of the loop is a conserved on-pathway 
state, the ND3 TMH1-2 loop and the surround- 
ing subunits must undergo large (>5 A) 
displacements during turnover. Disulfide cross- 
linking of the ND3 TMH1-2 loop has been 
shown to uncouple Q reduction and H* pump- 
ing in yeast CI (Y. lipolytica) (42), indicating 
that conformational flexibility in this region 
is essential for redox-driven H* pumping. Given 
the crowded environment of this loop in Tt- 
CI, explicit disordering of this loop during 
turnover is unlikely. Thus, Tt-CI is an im- 
portant model system in which to study these 
coordinated conformational changes during 
the CI catalytic cycle. 

Although the angle between Tt-CI’s mem- 
brane and the peripheral arm is more similar 
to that in some “open” states (fig. S9, E to H), 
given Tt-Cl’s well-ordered Q-reduction site loops 
(Fig. 2C) and the lack of a n-bulge in ND6 
TMH3 (fig. S9D), our structure corresponds 
most closely with the “closed” state of mam- 
malian and yeast Cls. This confirms that the 
“closed” state is an active state of the enzyme 
(12, 37, 38). Unlike the CI structures from 
opisthokonts and archaeplastidans (12, 13, 15, 34), 
resting Tt-CI does not contain structural classes 
with distinct angles between the membrane 
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and peripheral arms (i.e., open and closed 
states). Given the multiple additional sub- 
units that bridge the two arms of Tt-CI (Fig. 2, 
E and F), large conformational changes be- 
tween the peripheral and membrane arms 
are unlikely. Therefore, we can conclude that 
both the ND6 TMH3 r-bulge and the open 
state of CI with disordered Q-site loops are 
features of the D state of the enzyme and 
are not related to the conserved coupling 
mechanism. 


Tt-SC I+lllz breaks the symmetry of Tt-Clll, 


Although the overall arrangement of Tt-SC I+III, 
is like that of the mammalian supercomplex 
(fig. S7A) (34), the interactions between Tt-CI 
and Tt-CIII, are more extensive. Whereas 
mammals show one interaction site in the 
matrix and one in the IMM involving six sub- 
units (3), Tt-SC I+III, possesses three matrix 
bridges (sites 1 to 3), IMM interactions (site 4), 
and an IMS bridge (site 5; Fig. 4A) involving 
10 Tt-CIII, subunits and 11 Tt-CI subunits 
(fig. S13 and supplementary text). Two of 
T. thermophila’s matrix sites—the “thioredoxin/ 
J bridge” and the “toe bridge”—are formed by 
Tt-CI subunits not seen in other organisms 
(Fig. 4, B and C). The additional bridges in the 
matrix wedge between Tt-CI and Tt-CIII, and 
push their matrix domains apart. Concomi- 
tantly, novel interactions in the IMS, including 
a shared Zn** coordination between NDUAI1 
and UQCRH (fig. S131), pull the complexes 
together. Overall, these interactions splay Tt- 
SC I+II, in line with the IMM curvature (Fig. 
1B and fig. S7B). Furthermore, ordered lipids 
bridge the Tt-CI and Tt-CHI, interface at the 
matrix and IMS leaflets of the membrane (Fig. 
4, D and E). 

Although divergent in sequence, each Tt-CIII 
protomer is composed of the 10 known CIII 
subunits seen in other eukaryotic species (Fig. 
4, F and G) and two unidentified short chains 
(figs. S14 and S15 and supplementary text). 
CII, is a symmetric dimer, but the structures 
of mammalian supercomplexes provided ini- 
tial indications that CIII, may be symmetry- 
broken in higher-order assemblies (34, 43, 44). 
The structure of Tt-CIII, displays extensive 
structural and functional symmetry breaking 
caused by its interaction with Tt-CI, including 
differential flexibility or occupancy of several 
subunits (Fig. 4H). The UQCRFS1 head do- 
main adjacent to the CI Q reduction site is 
highly flexible and therefore lacks clear cryo- 
EM density, whereas the head domain prox- 
imal to the “toe bridge” (Fig. 4B) is restricted 
by its proximity to Tt-CI’s NDUB8, leading to 
clear density (Fig. 4, H and I). Given that the 
swinging motion of the UQCRFS1 head domain 
is essential for CIII,’s electron transfer from 
QH, to cyt c; (45), we conclude that the UQCRFS1 
proximal to the toe bridge is likely not func- 
tional. Thus, only the Tt-CIII protomer ad- 


SCIENCE science.org 


jacent to the source of QH, on CI is capable 
of QH, oxidation. The symmetry breaking of 
Tt-CII, is further demonstrated by the widen- 
ing of the distance between the two catalytic 
by, hemes (Fig. 4, J and K). In bacteria, yeast, 
plants, and mammals, the CIII, dimer ar- 
ranges the by, hemes within direct electron- 
transfer distance (~11 A) (20, 46); a similar 
distance is also conserved in the distantly relat- 
ed cytochrome bef complex of the photo- 
synthetic ETC (Fig. 4J) (47). Direct electron 
transfer between the by, hemes is known to 
occur and is thought to play a role in the cat- 
alytic mechanism of CIII, (48). In contrast, in 
T. thermophila the supercomplex interactions 
(Fig. 4A) and the membrane curvature (Fig. 
1B) widen the distance and change the relative 
tilt between the two Tt-CIII protomers, result- 
ing in an increased gap between the by, hemes, 
placing them beyond rapid electron-transfer 
distance (Fig. 4J). 

The symmetry breaking of the Rieske head 
domains and the increased distance between 
the by; hemes would result in the inability of 
the Tt-CIII COB subunit distal to the Tt-CI Q 
site to accept electrons, leading to the func- 
tional specialization of the two Q cavities (Fig. 
4L). In a symmetric CIII,, each Q binding cav- 
ity contains a QH,-oxidizing site (Qp) near the 
IMS and a Q-reducing site (Qy) near the 
matrix, each provided by the opposite COB 
protomer (34). Direct electron transfer across 
protomers means that there are four possible 
electron pathways between the Qp and Quy 
sites during the Q cycle (49). The symmetry 
breaking of the Rieske head domains and 
increased b;-heme distance in the Tt-SC I+III, 
structure remove three possible pathways. The 
only possible electron transfer path is from the 
Qp site of the Q cavity adjacent to Tt-Cl’s Q 
tunnel (the source of QH2) to the Qy site of the 
same COB subunit in the opposite Q cavity 
(Fig. 4L). This leads to the specialization of the 
Q cavities in the Q cycle: The one adjacent to 
the source of QH, would be responsible for 
QH, oxidation, and the other one for Q re- 
duction (Fig. 4L). Although this functional 
symmetry breaking has been previously pro- 
posed for mammalian SC I+lIII, (3, 34), the 
unprecedented asymmetry in Tt-SC I+III, re- 
veals how the CHI, electron transfer pathways 
are limited within the supercomplex, vali- 
dating previous predictions and confirming 
a key physiological function of supercomplex 
formation. Furthermore, the large number of 
contacts between Tt-CI and Tt-CIII,, the adap- 
tation of the supercomplex to the high mem- 
brane curvature, and the lack of biochemical 
detection of individual Tt-CI on BN-PAGE gels 
(fig. S2E) all suggest that Tt-SC I+III, is the 
minimal organization of Tt-Cl (i.e., no free 
Tt-CI is present in T: thermophila mitochon- 
drial membranes). Although obligatory super- 
complexes have been seen between CIII, and 


CIV in actinobacteria (50), none have been 
previously observed in eukaryotes. 


Tt-CIV2 is a 2.7-MDa dimer with a deep, 
charge-swapped cytochrome c crater 


Tt-CIV, is unlike any cytochrome c oxidase of 
known structure (fig. SI6A). With its 2.7-MDa 
size (1.35 MDa per protomer), it surpasses CI 
as the largest complex of the mETC (Fig. 1D). 
Its subunits include the three universally 
conserved core catalytic subunits, six acces- 
sory subunits found in other eukaryotes, and 
44 accessory subunits not seen in other CIV 
structures (fig. S17, table $3, and supplemen- 
tary text). Although the transmembrane do- 
mains and catalytic cofactors are conserved 
(fig. S18), all three Tt-CIV core subunits (COX1 
to 3) contain large extensions and insertions 
(Fig. 5A). The sequence and structural diver- 
gence of COX3 (formerly YMF68) is such that 
previous identification by homology was not 
possible (27). In the IMS, the core subunits’ 
extensions form the cyt c binding “crater” 
(Fig. 5B) together with several accessory sub- 
units (fig. SI9A). Analogous to Tt-cyt c (fig. S1, 
A and B), the Tt-CIV crater shows a flipped 
surface electrostatic potential relative to bac- 
teria, opisthokonts, and plants (Fig. 5, B to D, 
and supplementary text). This “swapped” 
charge property between Tt-CIV, and Tt-cyt c 
(fig. S20) explains why electron transfer can- 
not happen between mammalian cyt c and 
Tt-CIV. but can happen between mammalian 
cyt c and plant CIV (Fig. 1A). 

The proton conductance pathways (“K-, D-, 
and H-channels”) (57) that provide the chem- 
ical and pumped protons to CIV also show 
differences in T. thermophila. Although a 
critical proton acceptor in the D-channel is 
conserved (Glu®”? of COX1), a key threonine 
of the K-channel important for turnover in 
bacterial, archaeplastidan, and opisthokont 
CIV (Thr*!° in yeast) (52) is replaced by a 
nonprotonatable alanine in Tt-COX1 (Ala*®*) 
(fig. S19B). Thus, the Tt-CIV, K-channel either 
follows a different path or is not operational. 
Similarly, a key proton acceptor (Asp” of 
COX1 in bovine CIV) (53) and a conserved 
cation (18) are missing in Tt-CIV.’s H-channel 
(fig. S19B), suggesting a function as a dielec- 
tric regulator rather than as a proton channel 
in Tt-CIV, (57). Other key residues for Oy, ac- 
cess to the binuclear center and electron trans- 
fer from cyt c are conserved (fig. S19, C and D, 
and supplementary text). 


Tt-CIV2 incorporates mitochondrial carriers 
and a small TIM-like hexamer 


Most of Tt-CIV’s 44 accessory subunits re- 
mained uncharacterized or putative in the 
T. thermophila genome (fig. S17E) (29), with 
only 13 showing recognizable folds (fig. S17C). 
These subunits include three mitochondrial 
carriers (COXMC1 to 3; Fig. 5E), a hexameric 
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Fig. 4. Tt-SC I+lllz interactions and Tt-Clllz symmetry breaking. (A) Contact 
sites between Tt-Cl and Tt-Clllo in Tt-SC I+lll2, color-coded and shown in 
open-book configuration (orange, site 1; yellow, site 2; blue, site 3; red, site 4; 
green, site 5). (B and C) T. thermophila-specific Tt-SC I+lll2 matrix bridges 
(NDUJI/TX bridge and toe bridge) in the context of Tt-SC I+lllz, viewed from the 
matrix (B) or from the plane of the membrane (C). Key subunits are shown 

as colored cartoons and highlighted in black outline. (D and E) Modeled lipids 
(spheres) at the interface between Tt-Cl (blue surface) and Tt-Clll2 (green 
surface). Lipids at the matrix (D) or IMS (E) leaflet of the membrane are viewed 
from the matrix or IMS, respectively. (F and G) Structure of Tt-Clllz, with core 
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as transparent cartoon. Unidentified chains (polyalanine model) are in red. 

(H) Broken symmetry in the Tt-Clllz dimer. For clarity, maps were carved around 
the asymmetric subunits. For each protomer, density is shown in transparent 
gray, models as cartoons. The subunit labels indicate differences in each 
protomer. (1) UQCRFS1 head domain (magenta surface) constrained at the 
interface between Tt-Cl and Tt-Clllz. lron-sulfur (FeS) and heme cofactors 
(sticks) are shown. (J and K) Comparison of b-hemes (J) and c-hemes (K) in 
respiratory Clllo and photosynthetic bef complex dimers across cyanobacteria 
[PDB 1VF5 Mastigocladus laminosus bef complex (63)] in gray, bacteria [PDB 
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2YIU P. denitrificans (64)] in pink, yeast [PDB 6HU9 S. cerevisiae (18)] in blue, 
plants [PDB 7JRG V. radiata (20)] in green, mammals [PDB 1BGY B. taurus 
(46)] in yellow, and T. thermophila (orange). Hemes were superposed by one 
Tt-COB subunit in the dimer. Distances between hemes were calculated edge-to- 
edge of the macrocyclic conjugated system. For clarity, only distances for 

T. thermophila and B. taurus are shown. (L) Schematic of the symmetry-broken 
Q cycle and Q cavity specialization of Tt-Clllo. Tt-Clll protomers are shown in 
dark green surface (proximal to Tt-Cl Q tunnel, blue oval) or light green surface 
(distal to Tt-Cl Q tunnel). The functional Qy and Qp sites are indicated with 
dark green ovals, corresponding to the dark green protomer. The functional 


Fig. 5. Structural features of Tt-CIV2. (A) Struc- 
ture of Tt-ClV2, with core subunits in surface 

over silhouette of entire complex, viewed from 

the membrane (left) or IMS (right). (B) Surface 
electrostatic potential of Tt-ClV2. Cyt c crater 

is marked with a black circle. (© and D) Charge 
reversal in ClV-cyt c binding between mammals and 
T. thermophila. ClVs were aligned by COX2. Key 
residues are shown as sticks. (C) Binding of bovine 
COX2 (red cylinder, cow symbol) to equine cyt c 
(yellow cylinder, horse symbol) (PDB 5IY5) 

(74). (D) T. thermophila COX2 (gray cylinder). (E to 
G) Notable Tt-ClV2 accessory subunits viewed 

from the IMS. (E) Mitochondrial carrier subunits 
COXMC1 to COXMCS. (F) Other subunits with 
annotated folds. (G) Mitochondrially encoded sub- 
units YMF67, YMF70, and YMF75. (H) Side view 

of COXMC1-COXMC2 dimer (pink and green 
cylinders) with interfacial lipids (sticks). Key 
substrate-contacting residues, as well as conserved 
prolines of PX[D/E]XX[R/K] motif on odd-numbered 
helices, are shown as sticks. (I) Side view of 
COXMC3 (cyan cylinders) and the constraining 
amphipathic helix of COX3. (J) TIM83-TIM133-like 
heterohexamer domain with disulfide bonds in 
yellow. The Tt-specific loop of COX2 wraps around 
five of the COXTIM subunits and inserts itself into 
the central cavity of the hexamer. Amino acid 
abbreviations: D, Asp; E, Glu; K, Lys; P, Pro; R, 

Arg: Y, Tyr. 


COXME2 


N-terminal 


UQCRFS1 head domain that is capable of a flexible swinging motion (double- 
headed gray arrow) is shown in dark green surface, with its FeS cluster as 
spheres. Both b, and c; positions of the head domain are shown. Tt-Clllo’s 
functional hemes are shown as heteroatom-colored spheres. nonfunctional 
hemes as magenta spheres. Red crosses indicate inability of electron transfer. 
Tt-Cl's N2 FeS cluster is shown as spheres. Homology model of Tt-cyt c is 
shown in surface colored by electrostatic potential. Approximate position of inner 
mitochondrial membrane is marked with black lines. by, heme by; bl, heme 

bl; Cy, heme cy; e”, electron; H*, proton; Q, oxidized quinone; QH», reduced 
quinone (quinol); Qs, semiquinone. 


“\coxria4 


COXMC1 COXME3 


TIM8;-TIM13, chaperone-like domain (COXTIM1 
to 6) in the IMS, a B-propeller domain protein 
(COXBP), an acyl carrier protein (COXAC), 
and a [2Fe2S]-MitoNEET-like domain pro- 
tein (COXFS; Fig. 5F) with two FeS cofactors 
in the matrix (fig. S21 and supplementary 
text). Tt-CIV, also contains three mitochon- 
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loop 


drially encoded subunits (YMF67, YMF70, and 
YMF75) that do not have homologs in any 
other known CIV structures (Fig. 5G), includ- 
ing the ancestral a-proteobacterial CIV (54), 
indicating that the ciliate mitochondrial ge- 
nome has gained new genes. Additionally, 
there are several posttranslational modifica- 


tions in accessory subunits visible in the cryo- 
EM maps (fig. $18 and supplementary text). 
Mitochondrial carriers, or solute carrier fam- 
ily 25 (SLC25), are essential proteins that carry 
solutes into and out of the mitochondrial 
matrix (55). Although direct interaction between 
carriers and respiratory complexes has been 
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proposed (56), this has remained controver- 
sial. The structure of Tt-CIV, revealed a mito- 
chondrial carrier dimer (COXMC1-COXMC2) 
adjacent to COX1 (Fig. 5, E and H) anda 
monomeric mitochondrial carrier (COXMC3) 
adjacent to COX3 (Fig. 5, E and I, and sup- 
plementary text). 

Although both COXMC1 and COXMC2 are 
annotated as 2-oxoglutarate carriers (SLC25A11), 
key dicarboxylate-contacting residues (57) are 
fully conserved only for COXMC2 (Fig. 5H). 
The COXMC1-COXMC2 dimer is attached to 
the rest of Tt-CIV, via interactions with COX1’s 
N-terminal helical extension as well as with 
subunit COX7C (fig. S21A). COXMC1’s inter- 
actions with COX7C and COXTT22 block ac- 
cessibility from the IMS (fig. $21A), making it 
unlikely that COXMC1 functions as a trans- 
porter. Instead, COXMC1 may have been co- 
opted by Tt-CIV. as a structural or regulatory 
subunit. The COXMC1 N-terminal loop (resi- 
dues 1 to 8 of COXMC2) inserts deeply into 
the COXMC2 central cavity (Fig. 5H). Glu* 
of COXMC1 is in close proximity to several of 
COXMC2’s dicarboxylate-binding residues (Fig. 
5H). Given the similar chemical structures of 
oxaloacetate and glutamate, Glu* of COXMC1 
may act as a substrate analog, placing COXMC2 
in an inhibited state. 

The monomeric mitochondrial carrier sub- 
unit COXMC3 has long amphipathic o-helical 
extensions on both termini (Fig. 51). These 
extensions interact with core subunit COX3 
via bridging lipids in the IMS (fig. S21B). The 
N-terminal helix of COX3 also interacts with 
COXMCS3 in the matrix (Fig. 51). These inter- 
actions, as well as the broken symmetry of the 
carrier's TMHs (55) (Fig. 51), make COXMC3 
unlikely to function as a solute carrier. Rather, 
given its interactions on either side of the 
membrane and its peripheral location (fig. 
$21C), it is likely that COXMC3 has been co- 
opted to help bend 7. thermophila’s cristae. 

Tt-CIV, also contains a hexameric a-propeller 
domain composed of small TIM8-like subunits 
next to the cyt c crater (Fig. 5, F and J). Soluble 
TIM9;3-TIM10; (58) or TIM83-TIM133 (59) hex- 
amers function as IMS chaperones during 
mitochondrial protein import (fig. S22). More- 
over, a TIM93-TIM10.-TIM12 hexamer forms 
part of the TIM22 complex (60) that inserts 
mitochondrial carriers into the IMM (fig. S22). 
In contrast to canonical soluble TIM hexa- 
mers, the Tt-CIV, TIM assembly is a hetero- 
hexamer of six distinct subunits (COXTIM1 
to 6) (Fig. 5J). The TIM hexamer is anchored 
to Tt-CIV, via interaction with a Tt-specific 
loop of COX2. As seen in structures of TIM 
hexamers binding to substrate peptides (67), 
this Tt-COX2 loop embeds itself in the inter- 
helical clefts of the COXTIM subunits (Fig. 5J). 
Additionally, a short Tt-COX2 helix inserts it- 
self into the central channel of the Tt-TIM 
hexamer, parallel to the hexamer’s pseudo- 
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symmetry axis (Fig. 5J). Human TIM8a has 
been implicated in CIV maturation through 
interactions with COX6B and assembly factors 
COX17 and COX19, which deliver essential 
copper atoms to COX2 (62). In T. thermophila, 
a group of subunits, including homologs of 
COX6B and COX17, are found adjacent to the 
TIM hexamer of Tt-CIV, (fig. SI9A). These 
subunits may be a part of Tt-CIV’s matura- 
tion pathway and that stay bound to the final 
complex, which suggests that one pathway to 
subunit addition into mETC complexes may 
be via the permanent incorporation of as- 
sembly factors. 


Conclusions 


The structures of Tt-SC I+III, and Tt-CV, re- 
veal the diversity present in eukaryotic core 
metabolism. Collecting cryo-EM data on a 
partially purified sample allowed for the si- 
multaneous structure determination of an entire 
metabolic pathway. This work demonstrates 
that ciliate mETC complexes have diverged 
from those of opisthokonts and archaeplasti- 
dans, including incorporating additional ac- 
cessory subunits, flipping surface charges, 
splitting core subunits, and enhancing in- 
tercomplex interactions to produce highly 
curved and symmetry-broken assemblies. 
Furthermore, functional differences in the 
T. thermophila mETC complexes, such as the 
lack of a deactive state in Tt-CI, allow for 
the disambiguation of previous observations 
and will contribute to a more complete under- 
standing of the conserved mechanisms of 
these essential complexes. 
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FOREST ECOLOGY 


The biodiversity and ecosystem service contributions 
and trade-offs of forest restoration approaches 


Fangyuan Hua?2*, L. Adrian Bruijnzeel?**, Paula Meli™®, Philip A. Martin”, Jun Zhang*®, 
Shinichi Nakagawa’, Xinran Miao’, Weiyi Wang?, Christopher McEvoy”, Jorge Luis Pefia-Arancibia’°, 
Pedro H. S. Brancalion®, Pete Smith”, David P. Edwards”, Andrew Balmford? 


Forest restoration is being scaled up globally to deliver critical ecosystem services and biodiversity 
benefits; however, there is a lack of rigorous comparison of cobenefit delivery across different 
restoration approaches. Through global synthesis, we used 25,950 matched data pairs from 264 studies 
in 53 countries to assess how delivery of climate, soil, water, and wood production services, in 
addition to biodiversity, compares across a range of tree plantations and native forests. Benefits of 
aboveground carbon storage, water provisioning, and especially soil erosion control and biodiversity are 
better delivered by native forests, with compositionally simpler, younger plantations in drier regions 
performing particularly poorly. However, plantations exhibit an advantage in wood production. These 
results underscore important trade-offs among environmental and production goals that policy-makers 
must navigate in meeting forest restoration commitments. 


s the UN Decade on Ecosystem Resto- 
ration gets underway (J), forest resto- 
ration on degraded and deforested land 
is being scaled up globally, with far- 
reaching environmental and social im- 
plications (2-4). The Bonn Challenge, for 
example, has pledged to restore 350 million 
hectares of land by 2030 (5), and many other 
initiatives are similarly ambitious (6, 7). Large- 
scale programs to restore forests are frequent- 
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ly motivated by a desire to recover ecosystem 
services such as carbon storage (8), soil ero- 
sion control (9), water provisioning (0), and 
wood production (77). Based on an implicit 
assumption that these services can be effec- 
tively delivered by forests regardless of their 
composition, many of these programs grav- 
itate toward reforesting with compositionally 
simple tree plantations rather than restoring 
native forests (7, 10, 12). However, this prem- 
ise has yet to be tested rigorously with paired 
data that limit potential confounding factors 
(13) (supplementary text). This omission is crit- 
ically important for reasons beyond the target 
ecosystem services per se, because a focus on 
tree plantations has limited (J4)—and at times 
negative (9)—effects on native biodiversity and 
consequently risks severely limiting the con- 
servation potential of large-scale forest resto- 
ration, in turn hampering progress toward global 
commitments to halt and reverse biodiversity 
loss (15-17) and ecosystem degradation (1). 
We present a global synthesis of paired data 
from the world’s main forest biomes to assess 
the merits of forest restoration approaches, in 
particular reforesting with tree plantations 
versus restoring native forests, on deforested 
land that would have been naturally forested 
in recent history (see materials and methods) 
(18). We compare the performance of a range 


of compositionally simple tree plantations span- 
ning a wide spectrum of management regimes 
(‘tree plantations” hereafter) (18) with the per- 
formance of native forests (including restored 
and preexisting native forests) in delivering 
the key ecosystem services of carbon storage, soil 
erosion control, water provisioning, and wood 
production, as well as in supporting biodiver- 
sity. We further assess how variation in the rela- 
tive performance of tree plantations versus 
native forests may be explained by plantation 
features and biophysical conditions. Our study 
aims to enable forest restoration to achieve 
cobenefits in addressing today’s multiple envi- 
ronmental challenges (4), including the dual 
climate and biodiversity crises (8, 17). By sim- 
ultaneously considering the performance of 
forests in carbon, soil, water, and biodiversity 
(.e., environmental outcomes) in addition to 
performance in wood production, our study 
also provides a critical assessment of the trade- 
offs likely to confront forest restoration deci- 
sion makers. 

For each environmental outcome, we identi- 
fied the most informative metric with a reason- 
able amount of empirical data: aboveground 
biomass [megagrams per hectare (Mg ha’’)], 
amount of eroded soil [kilograms per square 
meter per year (kg m-” y_')], catchment- or 
plot-scale water yield (percent of rainfall), 
and species-specific abundance [individuals 
per hectare compiled for each species in a 
given ecological community; see (78) for ra- 
tionale of metric choices]. After searching 
peer-reviewed and gray literature and corre- 
sponding with authors, we compiled pairs of 
data that involved a tree plantation (classified 
into three types) and a matching native forest 
(classified into four types; Fig. 1A) from the 
same study system (78). For wood production, 
we compiled pairs of empirical data on wood 
yield [cubic meters per hectare (m® ha ‘)] or 
profit [US dollars per hectare (USD ha‘’)] that 
involved a tree plantation and a matching 
restored native forest (Fig. 1A) over equal time 
horizons (18); we excluded native forests not 
resulting from restoration because the sustain- 
ability of their wood harvest could rarely be 
confirmed. Given the paucity of paired wood 
production data, we relaxed the matching re- 
quirement to also compile annualized yield 
data just from restored native forests [(cubic 
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meters per hectare per year (m® ha! y~1)] 
(18), which we compared with known annual- 
ized yields of some of the world’s main mono- 
culture plantations (19). 

We assessed the rigor of matching for each 
data pair and weighed it accordingly in sub- 
sequent analyses (18). We calculated a log 
response ratio (RR) [In(tree plantation over 
native forest)] from each data pair to represent 
the relative performance of tree plantations 
versus native forests; we reversed the RR 
signs for eroded soil to represent soil erosion 
control. In total, our searches (78) (fig. Sl and 
tables S1 to S3) yielded 25,535 RRs for species- 
specific abundance on 13 species groups from 
405 plantation-native forest pairs, 146 RRs for 
aboveground biomass, 82 RRs for eroded soil, 
167 RRs for water yield, and 20 RRs for wood 
production, from 264 studies in 53 countries 
(Fig. 1 and table S4). In addition, we collated 
223 records on the standing wood volume of 
restored native forests with known age from 10 
studies in six countries (fig. S2 and table S4). 

We first asked how well tree plantations 
performed in environmental outcomes rela- 
tive to reference native forests not resulting 
from restoration, namely old growth forests 
and “generic” native forests (i.e., other non- 
restored native forests not reported as old 
growth). Not having undergone deforestation, 
these native forests represent reference environ- 


mental conditions (20) toward which forest 
restoration can aspire (Fig. 2A) (18). Consist- 
ent with prevailing understanding (14, 21), 
tree plantations supported on average 29.3% 
lower species-specific abundance than did 
reference native forests [95% confidence inter- 
val (CI): 22.0 to 35.9%; Fig. 2B, upper panel, 
and table S5; for differences among species 
groups, see fig. $3]. This biodiversity contrast 
was echoed across the other three environ- 
mental metrics, with tree plantations deliver- 
ing 32.8% lower aboveground biomass (95% 
CI: 16.5 to 45.9%), 60.6% lower soil erosion 
control (16.2 to 81.4%), and 13.4% lower water 
yield (4.1 to 21.9%; Fig. 2B, upper panel; table 
S5). These patterns were mainly driven by the 
poor performance of monoculture plantations, 
which exhibited the greatest contrasts with 
reference native forests (Fig. 2B, upper panel, 
and table S5). Prolonged age (240 years) or 
abandonment appeared to somewhat improve 
the environmental performance of plantations 
(18), with water yield shortfall no longer sig- 
nificant (mean: 6.7%; 95% CI: -23.4 to 29.4%; 
Fig. 2B, lower panel; table S5). However, dif- 
ferences for the other metrics persisted, albeit 
less marked: 14.6% (2.3 to 25.3%) for species- 
specific abundance and 24.0% (6.2 to 38.5%) 
for aboveground biomass; there were too few 
data to assess soil erosion control (Fig. 2B, 
lower panel, fig. S3, and table S5). 
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Fig. 1. Database overview. (A) The amount of paired data compiled into our 
database for different combinations of plantations and native forests. For 
species-specific abundance, the amount of data is represented by the number 
of plantation-native forest pairs that supplied species-level RRs for entire 
ecological communities; for all other metrics, it is represented by the number of 
RRs. (B) Geographical distribution of RRs of different metrics, displayed in 
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We next asked how well tree plantations 
performed relative to restored native forests of 
similar age (i.e., <10 years of age difference), 
represented by secondary forests resulting 
from natural regeneration, as well as actively 
restored native forests resulting from the 
planting of a diverse native tree mix (typi- 
cally =50 species; Fig. 1A, fig. S4, and Fig. 2A, 
lower panel) (78). With regards to environmen- 
tal performance, tree plantations performed 
significantly more poorly than restored native 
forests of similar age in species-specific abun- 
dance (32.3% poorer; 95% CI: 15.7 to 45.7%; 
there were insufficient data to contrast be- 
tween species groups; fig. $3) and margin- 
ally so for soil erosion control (80.2% poorer; 
-57.9 to 97.5%), but not aboveground bio- 
mass (4.1% greater; —23.1 to 40.9% and span- 
ning zero; Fig. 2C, upper panel, and table S5; 
data paucity precluded analysis for water 
yield). The similarity in aboveground bio- 
mass appeared to be due to the strong per- 
formance of abandoned plantations, which 
seemed to outperform both monocultures 
and mixed plantations (Fig. 2C, upper panel; 
although data paucity precluded formal anal- 
ysis on this). 

For wood production, the limited paired 
data showed that tree plantations had a clear 
advantage over restored native forests, with 
222.7% (105.8 to 406.0%) higher wood volume 


= © Wood production 


two maps for better visualization: species-specific abundance and aboveground 
biomass (upper panel) and soil erosion control, water yield, and wood production 
(lower panel). Bubble size in maps is proportional to the cube root of the 
amount of data for a given geographical location. * indicates that we did not 
compile paired wood production data for the comparison between tree 
plantations and reference native forests. 
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at comparable age (Fig. 2C, lower panel, and 
table S5; data paucity precluded analysis of 
profits from wood production). This advan- 
tage was apparent for both intensively man- 
aged and abandoned plantations, and regardless 
of whether wood volume included all woody 
species or only merchantable species (fig. S5). 
The same conclusion was reached by use of 
supplementary nonpaired data on annualized 
wood yields of restored native forests and 
various prominent monocultures: average an- 


nual volume increments for restored native 
forests were 61.3% (Welch two-sample t test: 
tog. 3 = —6.40, P <0.0001) and 86.9% (taog4 = 
-9.76, P <0.0001) lower than the lower and 
upper bounds of the monocultures, respec- 
tively (Fig. 2D). 

For all the above meta-analyses we found 
high levels of heterogeneity (78) with J?—the 
metric for heterogeneity—generally =>80% 
(table S5). Findings were robust to publication 
bias (supplementary text; fig. S6) and vari- 


ous sensitivity analyses related to weighting 
schemes and model structure (78) (table S5). 
They also showed that across the environ- 
mental metrics examined, tree plantations 
performed particularly poorly for soil erosion 
control (Fig. 2, right-hand panels). Because 
data for different metrics were obtained for 
different regions (Fig. 2, left-hand panels), 
the difference among environmental outcomes 
might reflect inherent biophysical differences 
among ecosystems. To address this potential 
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Fig. 2. Relative performance of tree plantations versus native forests 
across the metrics assessed. (A) Maps displaying the distribution and amount 
of data analyzed, for three types of comparisons: plantations versus reference 
native forests (upper panel), old (=40 years of age) or abandoned plantations 
versus reference native forests (middle panel), and plantations versus restored 
native forests of similar age (i.e., <10 years of age difference; lower panel). As 
with Fig. 1, bubble size is proportional to the cube root of the amount of data for 
a given geographical location. (B) Relative performance of plantations versus 
reference native forests (upper panel) and of old or abandoned plantations 
versus reference native forests (lower panel), in environmental metrics. 
Scattered dots in color represent RRs from primary studies across all types of 
plantations, and diamonds and associated error bars represent the mean and 
95% confidence intervals (Cl), respectively, of RR values obtained from meta- 
analyses in which the numbers of RRs are 210 (in the case of species-specific 
abundance, in which the numbers of plantation-native forest pairs are 210). For 
the comparison between plantations and reference native forests (upper panel), 
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we also analyzed RRs separately for different types of plantations for which 

the numbers of RRs are 210. For these analyses, we display their RR values from 
primary studies in gray, distinguishing among plantation types with different 
symbols for their meta-analysis—derived means and 95% Cl. (C) Relative 
performance of plantations versus restored native forests of similar age in 
environmental (upper panel) and production (lower panel) metrics, with symbol 
use following that of (B). (D) Annualized wood volume increment of restored 
native forests compared with the lower and upper bounds of the annual wood 
increment of the world’s major monoculture plantations. In our display, we 
differentiate between records on all woody plants and those on only 
merchantable species for restored native forests, and between the lower and 
upper bounds for plantations. In panels (B) and (C), scattered dots for species- 
specific abundance data represent the average RR within the ecological 
community concerned in each plantation-native forest pair, and a small number 
of RR values are not displayed because they fell outside the display range, 
including five highly negative RRs for soil erosion control indicated by * in (C). 
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geographical confounding effect, we next focused 
on a subset of our database in which data 
for different metrics could be geographically 
matched to a given ecosystem type, the bio- 
physical conditions of which were largely co- 
herent. Overlaying our data onto the Holdridge 
Life Zones map (22, 23), we identified “data 


bundles” for each forest biome where RRs 
were available for =2 metrics. In total, we iden- 
tified 11 such data bundles for the comparison 
between tree plantations and reference native 
forests (Fig. 3A) and seven for the compari- 
son between tree plantations and restored 
native forests of similar age (Fig. 3B). The 


patterns of how RRs for soil erosion control 
compared with other environmental metrics 
within each data bundle corroborated our 
earlier findings: Relative to reference native 
forests, plantation shortfalls were almost always 
greatest for soil erosion control and least for 
water yield (Fig. 3). 
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Fig. 3. Relative performance of plantations versus native forests compared 
among the metrics assessed, based on geographically matched data 
bundles for individual forest biomes. (A) Plantations versus reference native 
forests. (B) Plantations versus restored native forests of similar age (i.e., 

<10 years of age difference). RR values (in the case of species-specific 
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Fig. 4. Factors explaining the relative performance of plantations versus 
reference native forests. Best models selected based on AlCc scores identified 
the following factors as explaining RRs: (A) plantation age for aboveground 
biomass and for species-specific abundance (* indicates the latter concerning 
the comparison between abandoned plantations and reference native forests 
only) and (B) MAP for water yield. Scattered dots represent RR values from 


842 20 MAY 2022 + VOL 376 ISSUE 6595 


| © Species-specific abundance 


,° Soil erosion control © Water yield 


| +<om> - Boxplot showing 25%, 50%, and 75% quantiles j 


abundance, the average RR within the ecological community concerned in each 
plantation-native forest pair) are represented by scattered dots, and their 
quartiles by boxplots where the numbers of RRs are =5. For the comparison 
between plantations and restored native forests of similar age, data bundles 
were not available for the four forest biomes on the top. 
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primary studies (in the case of species-specific abundance, average RR within 
the ecological community concerned in each plantation-native forest pair), with 
dot size proportional to the weight of each RR in the meta-regressions, 
standardized within each metric to the RR with the greatest weight. Fitted curves 
(black lines) and 95% confidence bands and 95% confidence bands [colored 
bands in (A) and a gray band in (B)] were generated from meta-regressions. 
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We also asked what factors might underlie 
the variation in environmental performance 
of tree plantations relative to native forests. 
For the comparisons of plantations versus re- 
ference native forests and plantations versus 
restored native forests of similar age, respec- 
tively, we assessed the relationship between 
RRs and aset of variables representing planta- 
tion features and site biophysical conditions 
[note that analyses of wood production were 
dropped because of data paucity (J8)]. We 
considered plantation type, plantation age 
(except for the comparison involving restored 
native forests of similar age), and mean annual 
temperature (MAT), measured in degrees 
Celsius (78). The rationale for considering 
MAT was that by supporting higher plant 
diversity (24), warmer climates may show 
greater contrasts between plantations and 
native forests in terms of vegetation com- 
plexity, and in turn, in delivery of ecosystem 
services related to carbon, soil, and water (25). 
We also considered mean annual precipita- 
tion (MAP), in millimeters per year, for soil 
erosion because of its likely influence on pro- 
tective ground cover, as well as MAP and the 
seasonality of native forests (evergreen or de- 
ciduous) for water yield because of their likely 
influence on the hydrological behaviors of for- 
est ecosystems (18, 26, 27). 

The most parsimonious models selected 
through small-sample corrected Akaike In- 
formation Criterion (AIC) scores (78) (table S6) 
showed that increasing plantation age im- 
proved plantation performance relative to 
that of reference native forests in species- 
specific abundance and aboveground biomass 
(table S7), although such improvement was 
limited (Fig. 4A): particularly for aboveground 
biomass even old (240 years) plantations per- 
formed less well than reference native forests. 
Combined with the environmental shortfalls 
of old or abandoned plantations (Fig. 2B, lower 
panel), this finding suggests that old planta- 
tions no longer intended for productive use 
(28) would deliver environmental benefits 
more effectively if they were restored to native 
forest or native forest-like conditions. The fact 
that such areas are common in our database 
(Figs. 1A and 2A) indicates the sizeable envi- 
ronmental gains that such “forgotten lands” 
offer, underscoring the need to assess their global 
distribution and restoration potential (29). 

We also found that increasing MAP (range 
covered by our data: 490 to 4210 mm y~’) 
predicted more positive RRs for water yield 
when comparing tree plantations with reference 
native forests (Fig. 4B; table S7), indicating 
greater plantation shortfalls in water provi- 
sioning in drier climates. Clearly, water- 
oriented forest restoration initiatives should 
reexamine the practice of establishing large 
areas of tree plantations in the world’s drier 
regions (30). We did not find evidence of 
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other variables explaining variation in RR 
values or for any variable explaining planta- 
tion performance relative to restored native 
forests of similar age (fig. S7 and table S6). 
These findings were again robust to various 
sensitivity analyses related to weighting 
schemes and model structure (18) (tables S6 
and S7), with the exception of one sensitivity 
analysis on soil erosion control (table S6), 
which showed ameliorated plantation short- 
falls relative to reference native forests in 
warmer or wetter climates (fig. S7 and table S7). 

Our findings have implications for forest 
restoration as it is scaled up globally (7) and 
provide a knowledge base for exploring how 
outcomes can be best delivered by alternative 
restoration approaches. We found that restor- 
ing native forests typically delivers greater— 
and certainly no less—environmental benefits 
than establishing tree plantations in terms 
of biodiversity conservation and such key 
ecosystem services as aboveground carbon 
storage, soil erosion control, and water pro- 
visioning. However, delivering these outcomes 
will typically result in trade-offs with wood 
production because of the yield advantage 
of plantations over restored native forests 
(31-33), as measured in wood volume (distinct 
from aboveground carbon storage, which in 
addition to wood volume also factors in wood 
density). 

These findings provide evidence that if the 
goal of forest restoration is to recover environ- 
mental services on the land being restored and 
if wood production is not a primary concern, 
native forest restoration should be prioritized 
through use of site-appropriate measures in- 
cluding unassisted and assisted natural regen- 
eration and active planting of diverse native 
species (34-36). Beyond biodiversity, the stakes 
are especially high for soil erosion control given 
its far poorer delivery by tree plantations rela- 
tive to native forests. Our synthesis refutes the 
implicit assumptions of ecosystem service- 
oriented forest restoration initiatives such as 
China’s Grain-for-Green Program, which covers 
>34 million hectares (37, 38), and a large col- 
lection of projects targeting carbon storage 
(39), soil conservation (40), and water provi- 
sioning (41) that have focused mostly on estab- 
lishing (monoculture) tree plantations. 

However, where the goals of forest restora- 
tion include wood production, decision makers 
must navigate the trade-off between environ- 
mental and production outcomes (42). Beyond 
weighing competing goals and adopting res- 
toration approaches accordingly (43), larger- 
scale land-use planning must be invoked to 
also consider the “leakage” of forgone produc- 
tion to land parcels elsewhere; such leakage 
could alter—and even reverse—the overall en- 
vironmental gains of forest restoration (44). 
Ensuring environmental gains while meeting 
production goals under forest restoration 


hinges on understanding their trade-offs for a 
range of restored forest covers, making the 
acquisition of such information an urgent 
research priority. 

Interpretation of our results and associated 
policy recommendations raises three additional 
issues. First, although the environmental met- 
rics assessed were our best choices given data 
availability (78), they each characterize one 
aspect of a focal outcome. For example, beyond 
aboveground biomass, an assessment of forest 
carbon storage must also consider carbon 
stored belowground (45) as well as in long- 
lived wood products. Second, because our data 
came from established tree covers, they repre- 
sent achievable outcomes of successful forest 
restoration (13). In reality, restoration approaches 
and outcomes are often constrained by factors 
such as funding limitations, recurrent distur- 
bances, livelihood needs, and regeneration 
stochasticity, among others (46, 47). Third, 
although we used paired data and accounted 
for the rigor of site matching in our analyses 
(8), we cannot rule out the potential influence 
of preexisting site differences incurred by 
land-use history (73) and species turnover 
across space (betadiversity) (48), both of which 
are often difficult to ascertain. 

By presenting a global comparison between 
tree plantations and native forests that simul- 
taneously assesses their impacts related to 
biodiversity, climate, soil, water, and wood 
production based on rigorously paired data, 
our study provides insights into the alignment 
among these environmental goals and the 
trade-offs between environmental and pro- 
duction goals under forest restoration. Pre- 
vious research on the cobenefits of forest 
restoration has focused on “where to restore” 
(29, 49); by addressing “how to restore,” our 
study will help to improve the realism of 
future spatial prioritization efforts. Finally, 
other forest restoration outcomes such as 
food and nutrition security will be important 
in some contexts (50). Future research should 
address how these outcomes may fare under 
different restoration approaches, as well as 
their cobenefit opportunities and unavoidable 
trade-offs with other environmental and pro- 
duction goals. 
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KINETOCHORE 


Structure of the human inner kinetochore bound 
to a centromeric CENP-A nucleosome 


Stanislau Yatskevichy,, Kyle W. Muir}, Dom Bellini, Ziguo Zhang, Jing Yang, Thomas Tischer, 
Masa Predin, Tom Dendooven, Stephen H. McLaughlin, David Barford* 


Kinetochores assemble onto specialized centromeric CENP-A (centromere protein A) nucleosomes (CENP-AN*°) 
to mediate attachments between chromosomes and the mitotic spindle. We describe cryo-electron 
microscopy structures of the human inner kinetochore constitutive centromere associated network (CCAN) 
complex bound to CENP-A‘” reconstituted onto a-satellite DNA. CCAN forms edge-on contacts with 
CENP-AN“°, and a linker DNA segment of the a-satellite repeat emerges from the fully wrapped end of the 
nucleosome to thread through the central CENP-LN channel that tightly grips the DNA. The CENP-TWSX 
histone-fold module further augments DNA binding and partially wraps the linker DNA in a manner reminiscent 
of canonical nucleosomes. Our study suggests that the topological entrapment of the linker DNA by CCAN 
provides a robust mechanism by which kinetochores withstand both pushing and pulling forces exerted by the 


mitotic spindle. 


he centromere is a specialized genetic 
locus that interacts with the mitotic spin- 
dle to facilitate chromosome segrega- 
tion. Centromeres of humans and other 
primates are made of multiple copies of 
the conserved ~171-base pair (bp) a-satellite 
repeat that contains the histone H3 variant 
centromere protein A (CENP-A) (J, 2). Kinet- 
ochores assemble specifically onto centromeres 
to form large macromolecular machines that 
directly mediate chromosome segregation (3). 
A critical function of the kinetochore is to 
generate a load-bearing attachment between 
chromosomes and the spindle apparatus. 
How this function is achieved at the molecular 
level is unclear. 
Throughout the cell cycle, CENP-A nucle- 
osomes specify the constitutive association of 
the 16-subunit inner kinetochore constitutive 
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centromere associated network (CCAN) com- 
plex with centromeric chromatin, whereas the 
10-subunit KMN network of the outer kineto- 
chore assembles onto CCAN during mitosis 
to couple CCAN to microtubules for spindle- 
mediated chromosome movement. Two hu- 
man CCAN proteins, CENP-C and CENP-N, 
directly interact with CENP-AN“ (4, 5). CENP-C 
interacts with the C terminus of CENP-A and 
histones H2A and H2B (6), whereas CENP-N 
binds the L1 loop and adjacent DNA gyre of 
CENP-AN“¢ (7-17). The 17-bp B-box motif that 
interacts with CENP-B is the only DNA se- 
quence within the a-satellite repeat that is spe- 
cifically recognized by a centromere-associated 
protein. 

In budding yeast, a single kinetochore com- 
plex associates with a point centromere ac- 
commodating a sole CENP-A nucleosome. By 
contrast, the regional kinetochores of humans 
and other metazoans are generated from multi- 
ple copies of the CCAN and KMN network 
complexes to form large disk-like structures. 
In this work, we determined the cryo-electron 
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microscopy (cryo-EM) reconstruction of a 
CCAN-CENP-AN“ complex using a native 
171-bp a-satellite repeat DNA, representing 


interdependent modules 


kinetochore. 
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Fig. 1. CCAN is assembled from a network of interdependent modules. (A and 
B) Cryo-EM density map (A) and molecular architecture (B) of the apo-CCANS* 
fitted into the cryo-EM map with resolved CENP-QU'* and CENP-HIK'®“. (C) Atomic 
model of the apo-CCAN““" with details of the CENP-N-CENP-M interface as well as 
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Human CCAN comprises four modules—CENP- 


the repeating modular unit of a human inner | LN, CENP-HIKM, CENP-OPQUR, and CENP- 
TWSxX-—that are additionally linked by the 


largely disordered CENP-C through its interac- 


CCAN is assembled from a network of 
tions with CENP-LN and CENP-HIKM (7, 2, 13). 


We expressed and purified CCAN subcom- 
plexes and used the N-terminal half of CENP-C 
(CENP-CY) that contains all known CCAN 


CENP-HIKte2 


CENP-N: CENP-M interface 


CENP-HIK*29 


CENP-O binding to the CENP-IK pocket shown as insets. (D) Cartoon schematic of the 
CCAN‘°T modules. Single-letter abbreviations for the amino acid residues are as 
follows: A, Ala; C, Cys; D, Asp; E, Glu; F, Phe; G, Gly; H, His; |, lle; K, Lys; L, Leu; M, Met; 


N, Asn; P, Pro; Q, Gln; R, Arg; S, Ser; T, Thr; V, Val; W, Trp; and Y, Tyr. 
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binding sites and the central domain essential 
for centromere localization (74, 15) (fig. S1A). 
Phosphoproteomic analysis indicated an in- 
terphase phosphorylation state of CCAN, as 
judged by the absence of cyclin-dependent 


Fig. 2. Structure of the CCAN*'-CENP-A""° complex. (A) (Right) 
Consensus CCANS'-CENP-ANYS cryo-EM map (transparent white) 
overlaid onto the composite CCAN“'-CENP-AN“° cryo-EM density map 


kinase 1 (CDK1)-phosphorylation sites on 
CENP-T (3). Initially, we focused on the 11- 
subunit CCAN core complex (CENP-OPQUR- 
LN-HIKM), which could assemble without 
CENP-CN (termed CCAN*%) (fig. SIB). We de- 


CENP-QU®<t gif 


based on individual cryo-EM maps for the CENP-AN“°-CENP-CN and diameter of 11 nm. 
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termined the cryo-EM structure of CCAN*™ at 
3.2-A resolution (Fig. 1; fig. S2, A to D; and 
table S1). The cross-linked structure was iden- 
tical to a non-cross-linked reconstruction ob- 
tained at lower resolution (fig. S2E). We also 


CENP-I 


CENP-HIK*2¢ 
CENP-L 


WD oe —S 
ee), © 


CENP-HIKte24 


CCAN“T-DNA reconstructions. (Left) Composite map alone. (B) Two 
orthogonal views of the CCAN4'-CENP-AN"° complex depicted in cartoon 
representation for protein and space filling for DNA. CENP-AN“° has a 
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determined crystal structures of the CENP- | OPQUR and CENP-HIKM lobes assembled on | the N-terminal coiled-coil of CENP-QU. The 
OPQUR and CENP-HIK™4 modules (fig.S2F | either flank of a central CENP-LN module | RWD domains of CENP-O and CENP-P, with 
and table S2). (Fig. 1). The arc-like CENP-LN module isthe | the C-terminal regions of CENP-QU and 

The architecture of CCAN“ resembles bud- | structural keystone of CCAN“* and generates | CENP-R, organize into a cap domain above 
ding yeast CCAN (ScCCAN) (16, 17) with CENP- | adeep channel that is extended on one side by | CENP-N. In human CCANAT, CENP-R partially 


Linker DNA 
map density 


CENP-C FxxLFL 
motif 


CENP-HIKt# 
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s '. = 
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Fig. 3. The extranucleosomal linker DNA is gripped by the CENP-LN channel. (A) Cartoon representation of CCANS'-CENP-AN“° with cryo-EM density shown for 
extranucleosomal linker DNA (from the CCAN“™-DNA cryo-EM density map). (B) The CENP-LN module features a positively charged channel that complements the DNA duplex. 
Electrostatic surface charge is shown for CENP-LN. (C and D) Details of the CENP-LN channel-DNA interaction. R169 of CENP-N inserts into the DNA minor groove. CENP-LN 
surface charge is shown in (C). (E) Cartoon schematic of the CCAN*' modules showing DNA path. (F and G) CENP-C binds to two sites on CCAN. The DEFxIDE motif binds 
to CENP-LN (F), whereas the FxxLFL motif binds to CENP-HIKM (G). 
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Fig. 4. Architecture of the full CCAN“°-DNA complex. (A and B) Cryo-EM density map (A) and molecular model (B) of the CCAN*°-DNA complex with DNA map density 
shown in yellow. (©) CCAN assembles an enclosed chamber that topologically entraps linker DNA. (D) Electrostatic representation of the DNA binding chamber formed by 
CENP-LN, CENP-TW, and CENP-HIK"©24 modules shows a positively charged chamber that tightly grips DNA. (E) Cartoon schematic of CCAN showing DNA path through the 
CENP-LN, CENP-HIKM, and CENP-TWSX modules. (F) The CENP-TWSX module resembles the canonical H3-H4 nucleosome tetramer and partially wraps DNA. 
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substitutes for yeast Nkp1 and Nkp2. CENP-M, 
which is also specific to vertebrate CCAN, 
adopts a pseudo-guanosine triphosphatase 
(GTPase) fold and forms a stable complex with 
CENP-HIK (78). CENP-M makes extensive con- 
tacts with the HEAT domain of CENP-I and 
the N-terminal helical bundle of CENP-H and 
CENP-K and, in the context of CCAN, with 
CENP-L (Fig. 1). CENP-M has a small deletion 
that removes the switch I region and an ad- 
jacent B strand (78). A conserved f strand of 
CENP-N occupies this position to augment the 
central B sheet of CENP-M (Fig. 1C). An N- 
terminal o helix of CENP-O, connected through 
a flexible linker to CENP-OPQUR, interacts 
with a conserved pocket formed by CENP-I 
and CENP-K (Fig. 1C). Disrupting this interac- 
tion by deleting the N-terminal 35 residues of 
CENP-O destabilizes CCAN*“ assembly, as 
does mutating residues on CENP-N that bind 
CENP-M (fig. S3, A and B). Further supporting 
the human CCAN architecture described here 
is its consistency with the interdependent lo- 
calization of CCAN subunits to centromeres 
(12, 13, 18, 19). 

Two peripheral and conformationally flex- 
ible regions of CCAN are poorly resolved in the 
cryo-EM reconstruction: the CENP-HIK®4 


and N-terminal region of CENP-QU (CENP- 
Qu®°”) (Fig. 1). Focused three-dimensional 
(3D) classification of CENP-HIK"*** identi- 
fied multiple poses, two of which readily ac- 
commodate the CENP-HIK®™* crystal structure 
(figs. S2F and S3C). 


Reconstitution of CCAN-CENP-A'" onto a 
171-bp a-satellite repeat 


To assemble a complete CCAN“!-o-satellite 
nucleosome complex, we reconstituted CCAN“T 
complexes with CENP-CN and CENP-A®. using 
I71-bp o-satellite repeat DNA (CENP-AN“°-171). 
The o-satellite DNA matches an o-satellite 
repeat from human chromosome 2 and shares 
93% sequence identity with at least 14 other 
human centromeres (20) (fig. S4A). We used 
two separate 171-bp a-satellite sequences that 
differed in register by 22 bp (referred to as ASI 
and AS2) (fig. S4A). To define the exact nu- 
cleosome positioning of the AS2 sequence, we 
determined a cryo-EM structure of a CENP- 
AN°*_CENP-CN complex at 2.4-A resolution 
(fig. S4, B to D, and table S1). The dyad axis of 
the nucleosome is equivalent to nucleosome 
core particles (NCPs) containing X chromo- 
some a-satellite DNA, with which it shares 
71% sequence identity (J0, 27) (fig. S4A). This 


also matches nucleosome positions defined by 
in vivo chromatin immunoprecipitation (ChIP) 
micrococcal nuclease sequencing (MNase-seq) 
analyses of human centromeric chromatin 
(21, 22). Relative to histone H3 nucleosomes 
(H3™", CENP-AN“ exhibits continuous con- 
formational flexibility of its terminal DNA seg- 
ments as shown by a 3D variability analysis. 
The DNA ends fluctuate between fully wrapped 
at both ends to being unwrapped by ~22 bp 
at one end, with most particles in interme- 
diate states (fig. S4E and movie S1). Un- 
wrapped and flexible terminal DNA segments 
are seen in yeast and vertebrate CENP-AN“* 
structures (17, 23), and this correlates with a 
110- to 120-bp nuclease-resistant DNA core 
(17, 23). These in vitro studies are also in 
concordance with CENP-AN"° protecting 110 
to 150 bp of a-satellite DNA based on in vivo 
ChIP-MNase-seq studies of natural human 
centromeres and neocentromeres as well as 
observations that CENP-AN“~ js a canonical 
octameric nucleosome in cells (21, 24, 25). A 
structure of CENP-AN“ reconstituted with AS1 
(from the CCAN“?-CENP-AN"°-AS1 cryo- 
EM dataset) was positioned identically to 
CENP-AN"°-AS92, except that the 3’ unwrapped 
end is absent because of the 22-bp 5’ register 


Fig. 5. Structure of the complete CCAN-CENP-AN“° inner kinetochore module. (A) Atomic model of the CCAN-CENP-AN“° complex. (B) Cartoon schematic of 
the CCAN-CENP-AN“° complex. (C) Mutations of CENP-N and CENP-TW previously shown to impair centromere function contribute to the DNA binding channel of the 
CCAN (4, 7-9, 19, 29). Residues implicated in DNA binding by CENP-L—and mutated in the current study—are also shown. 
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shift relative to AS2 (fig. S4A). In addition 
to resolving the CENP-AN"*, we were able 
to build the CENP-C central domain inter- 
acting with the C terminus of CENP-A, and 
histones H2A and H2B, a structural feature 
of CENP-AN““ recognition conserved from 
yeast to humans (6, 10, 11, 17). 


The positively charged CCAN channel grips 
linker o-satellite DNA of CENP-AN“° 


We then determined cryo-EM structures of 
CCAN“T-CENP-AN"* complexes using both 
the AS1 and AS2 DNA (figs. S5 and S6). These 
two structures were identical except for their 
DNA boundaries (figs. S4A and S6B). Because 
the AS1 sequence produced a more stable 
complex, as judged by size exclusion chroma- 
tography (SEC), a larger cryo-EM dataset was 
collected with this DNA (table S1). 

Cryo-EM micrographs of the CCAN“T-CENP- 
AN*© datasets and 2D-class averages showed 
particles corresponding to three species: (i) 
CENP-AN*°-CENP-CN, (ii) CCAN“T bound to 
free DNA, and (iii) CCAN“T bound to CENP- 
AN¢ (fig. $5, A to C). The observation of one 
CCAN“’ associated with CENP-AN” is in agree- 
ment with an assessment of the oligomeric 
state of CCAN“?-CENP-AN“” in solution, which 
indicates a single CCAN“’ assembled onto 
CENP-AN“ (fig. S5D). 

In the CCAN“?-DNA complex, determined 
at 4.5-A resolution, EM density is visible for a 
linear ~24-bp DNA duplex threading through 
the CENP-LN channel (Fig. 2A, Fig. 3A, and 
fig. SSE). The CCAN“T-CENP-AN“. complexes 
exhibited conformational heterogeneity, lim- 
iting the resolution of a consensus cryo-EM 
reconstruction to 8.9 A (Fig. 2A and figs. S5 
and S6). Focused 3D classification and refine- 
ment of the CCAN*’-DNA component alone 
produced a 7.3-A-resolution reconstruction 
with a protein and DNA structure identical 
to that of the CCAN“’-DNA complex (fig. $5, 
C and E). We therefore generated a com- 
posite CCAN“?-CENP-AN” structure from the 
higher-resolution CENP-AN“°-CENP-CN and 
CCAN“T-DNA reconstructions (Fig. 2A). In the 
CCAN“T-CENP-AN“¢ complex, the CENP-AN“ 
DNA is wrapped similarly to the free CENP- 
AN*_CENP-CN (fig. S6). 

A notable feature of the CCAN“'-CENP- 
AN© complex is that CCAN“T forms few direct 
contacts with CENP-AN“, but instead its pri- 
mary contact with CENP-AN" is through the 
extended extranucleosomal linker DNA. CENP- 
AN© forms a small end-on contact, mediated 
through its DNA gyre, with the back-face of 
CCAN“' (Fig. 3, B and C). This positions the 
extranucleosomal DNA to thread through the 
CENP-LN channel. Clearly resolved cryo-EM 
density for the DNA phosphate backbone 
shows it tightly gripped by conserved basic 
residues of the CENP-LN channel (fig. S5E). 
These create a dense, continuous, and posi- 
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tively charged surface complementing the 
shape and charge of the 24-bp linker DNA 
duplex (Fig. 2 and Fig. 3). Interactions of 
the side chains of CENP-N residues K148, 
M167, and R169 with the DNA minor groove 
contribute to defining a fixed DNA register 
(Fig. 3, C and D). In the DNA-bound state, 
the CENP-LN channel contracts, tightening 
its grip on the DNA duplex (fig. S5F), and 
the channel is extended by the apposition of 
CENP-HIK®°"¢ in the raised conformation 
(Fig. 3, A and E). 


CENP-C interaction sites on CENP-LN and 
CENP-HIKM modules 


CENP-C is required for the centromere local- 
ization of CENP-LN, CENP-T, CENP-O, and 
the CENP-HIKM module (5, 7, 12, 13, 18). The 
CCAN-interacting region of CENP-C maps to 
the PEST sequence (CENP-C"""), a 200-residue 
intrinsically disordered region N-terminal of 
the CENP-C central domain that features two 
highly conserved short linear sequence motifs 
(7, 13, 14). Cryo-EM maps of the CCAN“'-DNA 
and CCAN*T-CENP-AN““ complexes indicated 
two volumes of unassigned map density not 
present in apo-CCAN“ and CCAN“°-DNA 
complexes (discussed below), associated with 
the CENP-HIKM and CENP-LN modules (fig. 
S6D), previously shown to interact with CENP-C 
(7, 12-14). To guide the interpretation of these 
densities, we used AlphaFold2 (26) to predict 
potential models of how CENP-C’"ST would 
interact individually with CENP-LN, and CENP- 
HIKM. The first run predicted that the con- 
served DEFxIDE motif (residues 301 to 307) 
(7, 12-14) of CENP-C?"ST would form an edge 
6 strand with the CENP-N 8 sheet (Fig. 3F). 
This position corresponded well to additional 
cryo-EM density associated with CENP-N (fig. 
S6D). Phe and Ile residues of the DEFxIDE 
motif dock into a hydrophobic pocket at the 
CENP-LN interface, whereas the flanking acidic 
residues form electrostatic interactions with 
conserved Arg and Lys residues of CENP-N 
and CENP-L. In support of this model, a prior 
study mutating the EFxID residues of the 
CENP-C DEFxIDE motif abolished its inter- 
action with CENP-LN in vitro and disrupted 
CENP-N recruitment to centromeres in vivo 
without perturbing CENP-C localization (7). 
For CENP-HIKM, AlphaFold2 also predicted 
that the CENP-C?"™ST FxxLFL motif (residues 
262 to 267) (13) would bind as an o helix toa 
hydrophobic site at the combined CENP-H- 
CENP-K-CENP-M interface (Fig. 3G). This 
prediction was validated by cryo-EM density 
at this location (fig. S6D) and is consistent 
with a previous study that found that mutat- 
ing the LFL residues of this motif disrupts 
CENP-C interactions with CENP-HIKM in vitro 
and in vivo (13). Both the LN and HIKM inter- 
action sites are located on the back-face of 
CCAN, facing CENP-AN"*, which suggests a 


mechanism for how CENP-C tethers kinet- 
ochores to CENP-AN“ (5) (Fig. 3E). 


a-satellite linker DNA is a crucial determinant 
of stable CCAN association with CENP-ANY 


Our CCAN“?-CENP-AN” structure suggests 
that the interaction of the extranucleosomal 
DNA duplex with the CENP-LN channel is a 
major determinant of CCAN assembly onto a 
centromeric o-satellite-CENP-AN". To test 
this, we performed reconstitutions assessing the 
role of the extranucleosomal DNA, CENP-CN, 
basic residues lining the CENP-LN channel, 
CENP-HIK®@*, and CENP-QU’. We first 
tested the requirement of the extranucleoso- 
mal DNA. Using SEC, we found that CENP- 
AN¢ wrapped with 147 bp of o-satellite DNA 
(10) did not form stable complexes with CCAN“* 
without CENP-CN, whereas a 171-bp o-satellite 
that included the extranucleosomal DNA 
formed stable complexes with CCAN“’ in- 
dependently of CENP-CN (fig. $7, A and B). 
Interactions with CENP-AN“~ reconstituted 
with a171-bp o-satellite were equally stable 
in the presence or absence of the B-box, con- 
sistent with sequence-independent inter- 
actions with the DNA phosphate backbone 
(fig. S7C). 

Having established a requirement for linker 
DNA in the assembly of CCAN“°T-CENP-AN“° 
complexes, we next assessed how modifica- 
tions to the CENP-LN channel affected CCAN“? 
interactions with CENP-AN“*. In the absence 
of CENP-CN, mutating three positive patches 
of CENP-L (CENP-Lcem), which interact with 
linker DNA and the nucleosome gyre (Fig. 3, 
B to D, and fig. S5F), abolished binding of 
CCAN“* to CENP-AN°-171 without perturb- 
ing CCAN assembly (fig. S7D). Identical results 
were obtained by deleting either CENP- 
HIK#*¢ or CENP-QU* (fig. S8, A and B). 
These deletions also moderately reduced 
CCAN binding to isolated DNA (fig. S8C). To 
assess the validity of the CCAN-CENP-AN”* 
model in a cellular context, we tested the 
effect of the CENP-Lcm mutant that disrupted 
CCAN“ binding to CENP-AN"°-171 in vitro 
on centromere localization of CENP-L in vivo. 
In HEK293 cells, localization of the CENP- 
Lem mutant to kinetochores was substantially 
impaired compared with wild-type CENP-L 
(fig. S8D). 

In previous structures of the CENP-N N- 
terminal domain (CENP-N®') associated with 
CENP-AN"¢ (7, 8, 10), CENP-N™" interacts with 
the L1 loop of CENP-A (CENP-A™ !°°?) and 
adjacent DNA gyre at SHL2-3. In agreement 
with these structures, we found that mutat- 
ing CENP-A™ !°°P disrupts the interaction of 
the isolated CENP-LN dimer with CENP-AN“° 
wrapped with 171-bp a-satellite DNA (fig. S9A). 
By contrast, our structure of CCAN-CENP- 
AN"°_171 does not involve an interaction of 
CENP-N™ with CENP-A“™©°P, and, consistent 
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with this, mutating CENP-A™ °° did not 
disrupt interactions of CENP-AN“°-171 with 
CCAN“*T, as assessed by both pull-down as- 
says and during reconstitution on SEC (fig. S9, 
A and B). These results, as well as previous 
structural information (7-10), suggest that the 
mode of interaction between CENP-LN and 
CENP-AN is different when CENP-LN is part 
of the full CCAN complex compared with 
CENP-LN alone. To further test this hypothe- 
sis, we performed competition assays where 
we added CENP-N™, which binds specifically 
to CENP-A“™ °°? at a fourfold excess over the 
CCAN-CENP-AN““ complex (fig. S9C). The 
CCAN-CENP-AN“ complex readily accom- 
modated the additional CENP-N™’ without 
evident diminution of CCAN binding, and 
this additional CENP-N*" incorporation was 
dependent on CENP-A" °°. This suggests 
that in a fully assembled CCAN-CENP-AN““ 
complex, CENP-A“™ °°? is not occupied and 
is therefore available for CENP-N™” binding 
(fig. S9D), consistent with our cryo-EM recon- 
truction. Modeling a CENP-N‘’-based inter- 
action of CCAN with CENP-A"™ '°°? shows that 
CENP-AN"° would clash extensively with the 
CENP-L, CENP-HIK, and CENP-OPQUR mod- 
ules (as well as the CENP-TWSX module, as 
discussed below) (fig. S9E). In both models, 
CENP-N™T forms the same interactions with 
the DNA backbone, although at SHL7-8 for 
CENP-N™" in CCAN (fig. S9F). Thus, previ- 
ously described mutants at the CENP-N‘T- 
DNA interface that disrupt CENP-N-centromere 
localization in vivo (4, 7, 8) are consistent with, 
but do not distinguish between, both modes of 
CENP-N-CENP-A"® interaction. 

Overall, the interaction of free CENP-LN 
with CENP-AN“ is dependent on CENP-AM °P 
but is independent of the linker DNA, whereas 
CCAN interactions with CENP-AN are depen- 
dent on the linker DNA but independent of 
CENP-A"™ !°°P. These results are consistent 
with different modes of interaction between 
CENP-LN and CENP-AN“° when CENP-LN is 
incorporated into CCAN. This agrees with a 
previous observation that the L1 loop is not 
required for the in vitro assembly of a func- 
tional kinetochore (27). 


CENP-C confers selectivity of assembled 
CCAN for CENP-A mononucleosomes 


The only direct interaction between CCAN“T 
and the CENP-A histone in our structure is 
mediated by CENP-C™. To validate this find- 
ing, we produced human H3 nucleosomes re- 
constituted either with a 147-bp 601 sequence 
or a 601 sequence extended at the 5’ end by 
30 bp (177 bp) (fig. S4A). As expected, CCAN“T 
failed to bind H3N“° reconstituted with the 
minimal 147-bp 601 sequence, whereas CCAN“* 
bound CENP-AN" with the same DNA only in 
the presence of CENP-CN, similar to 147-bp 
a-satellite DNA (figs. S7A and S10A). Con- 
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sistent with our structure, binding was ob- 
served with the extended 177-bp sequence 
for both H3N"°-177 and CENP-AN"°-177 (fig. 
S10B). The binding of CCAN““T to CENP- 
AN°_177 but not to H3N"°-177, was enhanced 
four- to fivefold by CENP-CN, consistent with 
selective recognition of CENP-AN“° by CENP-C 
(fig. S10C). 


CENP-TWSX binds and partially wraps 
linker DNA 


The CCAN“?-CENP-AN“ structure was recon- 
stituted without the CENP-TWSX module 
because of our initial difficulties incorpo- 
rating CENP-TWSX into CCAN. Removing all 
affinity tags from CENP-WSX enabled CENP- 
TWSxX incorporation into CCAN (fig. S10D), 
and, as judged by SEC, this 16-subunit holo- 
complex interacts with the 171-bp a-satellite- 
CENP-AN“ similarly to CCAN“’ (fig. S1OD), 
consistent with CENP-A, CENP-C, and CENP-T 
making up a single 20-subunit inner kineto- 
chore complex (28). The full CCAN complex 
also requires extranucleosomal linker DNA 
to bind CENP-AN"”¢ without CENP-C, which 
suggests that CENP-TWSX does not change 
the fundamental mechanism of CCAN as- 
sembly onto CENP-AN“ (fig. SIOE). Electro- 
phoretic mobility shift assay experiments 
revealed that CENP-TWSX increases the af- 
finity of CCAN“* for both CENP-AN” and 
H3%“© nucleosomes containing 177-bp 601 DNA 
by approximately threefold, which suggests 
that it does not confer additional selectiv- 
ity for CENP-AN“ over canonical H3%" (fig. 
S10, C and F). 

To understand how CENP-TWSX contrib- 
utes to CCAN recognition of DNA and CENP- 
AN"© we determined a cryo-EM structure 
of CCAN“° with a 53-bp DNA linker seg- 
ment (fig. S4A). We obtained a well-resolved 
cryo-EM map at 2.8-A resolution with all 
regions of CCAN and the DNA clearly de- 
fined (Fig. 4, A and B; fig. S11; table S1; and 
movie S2). 

The four histone-fold domains of CENP- 
TWSX resemble the histone H3-H4 tetramer 
and are similar to crystal structures of isolated 
chicken CENP-TWSX (19). CENP-TWSX forms 
multiple interactions with neighboring CCAN 
subunits and DNA (Fig. 4, B and C). Contacts 
with a slightly repositioned CENP-HIK™“ are 
mediated by both CENP-T and the extended N 
terminus of CENP-W. CENP-W also directly 
contacts the CENP-N pyrin domain. A CENP- 
TWSX-mediated linkage of CENP-HIK®“* to 
CENP-N and CENP-QU' creates an enclosed 
chamber that topologically entraps the extra- 
nucleosomal DNA (Fig. 4, A to D). The chamber 
comprises CENP-LN, CENP-HIK, and CENP- 
TW modules and is augmented and rigidified 
by all other CCAN subunits (Fig. 4E). As in the 
CCAN“T-DNA complex, the DNA duplex inserts 
into the deep CENP-LN channel; however, 


CENP-TWSX and the repositioned CENP- 
HIK#°4 induce a marked curvature of the 
DNA. Notably, the DNA starts to wrap around 
the CENP-TW histone-fold domains as it em- 
erges from the CENP-LN channel in a manner 
comparable to how a canonical H3-H4: tet- 
ramer wraps DNA (Fig. 4F). This DNA binding 
is augmented by CENP-HIKE@+. Consequently, 
the DNA duplex engages a 50-A-long positively 
charged groove at the interface of CENP-[#<24 
and CENP-TW (Fig. 4D). Similar to nucleo- 
somes, the N terminus of the CENP-T (HFD) 
inserts into the minor groove of the extra- 
nucleosomal DNA positioning the highly con- 
served R450 to contact the DNA backbone 
(Fig. 4F). The CENP-LN channel and CENP- 
TW-CENP-I#"4 groove perfectly complement 
the DNA duplex, with the DNA phosphate 
backbone forming a continuous interface 
with CCAN, burying ~976 A? of DNA sur- 
face area. 

To complete our structural analysis, we de- 
termined cryo-EM structures of CCAN in 
complex with CENP-AN”“ reconstituted with 
171-bp o-satellite DNA (Fig. 5, A and B, and 
fig. S11, C to E). CCAN assembles onto CENP- 
ANv¢ similarly to CCAN“’. The nucleosome 
associates with the back-face of CCAN, and 
the extranucleosomal DNA threads through 
the CENP-LN channel. The linker DNA of the 
a-satellite repeat is gently curved and interacts 
with CENP-TW-CENP-1?¢ reminiscent of 
the CCAN“°-DNA complex. We note that the 
AS2 a-satellite DNA of 171 bp in length is ~15 bp 
too short to wrap all the way around the CENP- 
TWSX module. 

CENP-TW contributes to DNA binding through 
conserved basic residues that were previously 
implicated in mediating DNA binding in vitro 
and, for CENP-W, are required for mitotic 
progression and its recruitment to kineto- 
chores in vivo. These functions of CENP-TW— 
and the ability of CENP-TWSX to supercoil 
DNA~—led to proposals that CENP-TWSX is a 
DNA binding nucleosome-like particle, con- 
sistent with the curvature of the DNA in our 
structure (19, 29). These residues of CENP- 
TW, together with basic residues of CENP-N, 
whose mutation disrupts CENP-N association 
with CENP-AN“~ in vitro, and centromere lo- 
calization in vivo (4, 7-9) form a continuous 
spine lining the conserved CCAN-DNA inter- 
face (Fig. 5C). 

Many vertebrate cell lines are viable and ex- 
hibit normal chromosome segregation with- 
out the CENP-OPQUR module (72, 30), which 
contrasts with our finding that deletion of the 
CENP-QU“? abrogates CCAN“ binding to 
CENP-AN“ (fig. S8, B and C). A likely expla- 
nation for this is the absence of the CENP- 
TWSX module in CCAN““, which contributes 
to DNA binding (Fig. 4 and Fig. 5, A and B). 
Consistent with this, we found that in the 
presence of CENP-TWSX, removing the entire 


20 MAY 2022 * VOL 376 ISSUE 6595 851 


RESEARCH | RESEARCH ARTICLES 


CENP-OPQUR module (CCAN“®) caused only 
a small decrease in the affinity for both DNA 
and CENP-A“ (fig. S12, A and B). Addition- 
ally, CCAN*© readily formed a complex with 
CENP-AN“< (fig. S12C). These in vitro results 
support the idea that CENP-OPQUR contrib- 
utes relatively little to the attachment of ver- 
tebrate kinetochores to chromatin. 


Discussion 


The absence of both mitotic phosphorylation 
and the outer kinetochore suggests that the 
structure we describe likely reflects an inter- 
phase kinetochore. A cryo-EM structure of hu- 
man apo-CCAN (31) agrees with our CCAN 
reconstructions, and the proposed model of 
DNA engagement by the CENP-LN channel 
is shown from our CCAN-CENP-A™“ complex 
to represent extranucleosomal linker DNA. 
Defining the structure of the holo-kinetochore 
at mitosis remains an outstanding challenge, 
and it will be important to further validate the 
cryo-EM structures in cells. 

The notably similar features of DNA du- 
plex recognition by the basic CENP-LN chan- 
nel of HsCCAN and ScCCAN (17) suggests a 
common evolutionary origin of centromere- 
kinetochore assembly. In humans, howev- 
er, the DNA duplex engaged by CCAN is 
the o-satellite linker DNA rather than the 
unwrapped CENP-AN”” DNA of budding yeast. 
CCAN engages ~40 bp of extranucleosomal 
DNA, which could be facilitated either by the 
unwrapping of ~16 bp of the upstream CENP- 
AN"°_consistent with our structures show- 
ing DNA unwrapping in CENP-AN“°—and/or 
by the sparse occupancy of CENP-AN”* on 
centromeric DNA (32). Such organization 
is also consistent with evidence that CENP-C 
and CENP-T bridge adjacent nucleosomes 
(28). The topological entrapment of linker 
DNA provides a molecular explanation for 
how the inner kinetochore can withstand 
strong pushing and pulling forces applied 
by the mitotic spindle, a central function 
of this large macromolecular assembly. Our 
work also confirms CENP-C as a key deter- 
minant of CCAN specificity for CENP-AN"*, 
providing a molecular explanation for how 
this essential CCAN component directly 
links the inner kinetochore to centromeric 
chromatin. 

Our finding that CCAN can bind to either 
canonical or CENP-A nucleosomes bearing ad- 
equate linker DNA could provide an explana- 
tion for how the inner kinetochore maintains 
attachments at centromeres where H3\“ vast- 
ly exceeds CENP-AN” across the entire cen- 
tromeric o-satellite repeats (33). Furthermore, 
it could explain why synthetic depletion of 
CENP-A once CCAN is assembled onto centro- 
meres nevertheless permits mitotic centromere 
function in human cells (34). Salt extraction of 
CENP-A does not reduce CENP-N and CENP-C 
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levels in centromeric chromatin (35), which 
suggests that once CCAN is assembled at cen- 
tromeres in a CENP-A-dependent process 
(36, 37), CENP-A is dispensable for localiza- 
tion. Lastly, assembly of CCAN onto linker 
DNA provides a possible explanation for how 
some organisms that lack CENP-A, yet still 
depend on CCAN for chromosome segrega- 
tion, might function (38). 

An emerging question from this study is 
how human CCAN localizes specifically to the 
centromere given its modest preference for 
CENP-AN"“ mononucleosomes compared with 
H3N"" (33). One explanation is that the CENP- 
LN module is selectively recruited to the cen- 
tromere, through recognition of CENP-A™P, 
during early stages of the cell cycle and is then 
displaced to form a fully assembled CCAN, 
thus confining kinetochore assembly to a spe- 
cific chromatin locus. CENP-LN levels at the 
centromere decline upon mitotic entry (10, 39), 
which supports this sequential and specific 
localization mechanism. CDK-mediated CENP-C 
phosphorylation (17, 40) and/or chromatin 
compaction (39) might regulate this process. 
CENP-B recognizes the B-box motifs of the 
a-satellite repeat and interacts with CENP-C, 
which provides another possible mechanism 
for specific CCAN recruitment (47). Different 
degrees of wrapping of CENP-AN” in cen- 
tromeric arrays compared with H3N“° might 
also provide selectivity for CCAN assembly, as 
could the H1 histone that would block access 
of CCAN to the linker DNA at H3“"°. Our 
structure permits a model for the arrangement 
of CCAN within the centromeric arrays of re- 
gional kinetochores (fig. S12D). This study also 
highlights the common mechanisms of topo- 
logical entrapment used by biological systems 
responsible for accurately mediating chromo- 
some segregation, such as the cohesin complex 
that—likewise resisting vast forces exerted by 
the spindle—maintains cohesion between sis- 
ter chromatids. 
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FOSSIL RECORD 


Global record of “ghost” nannofossils reveals 
plankton resilience to high CO. and warming 


Sam M. Slater’, Paul Bown, Richard J. Twitchett®, Silvia Danise*, Vivi Vajda’ 


Predictions of how marine calcifying organisms will respond to climate change rely heavily on the fossil 
record of nannoplankton. Declines in calcium carbonate (CaCO3) and nannofossil abundance through 
several past global warming events have been interpreted as biocalcification crises caused by ocean 
acidification and related factors. We present a global record of imprint—or “ghost’—nannofossils that 
contradicts this view, revealing exquisitely preserved nannoplankton throughout an inferred Jurassic 
biocalcification crisis. Imprints from two further Cretaceous warming events confirm that the fossil 
records of these intervals have been strongly distorted by CaCO3 dissolution. Although the rapidity of 
present-day climate change exceeds the temporal resolution of most fossil records, complicating 
direct comparison with past warming events, our findings demonstrate that nannoplankton were more 
resilient to past events than traditional fossil evidence suggests. 


s CO, levels in the atmosphere rise, re- 

sultant ocean acidification (OA) and 

declining seawater carbonate ion con- 

centrations will likely make it more dif- 

ficult for marine organisms to form their 
calcium carbonate (CaCO3;) skeletons or shells 
CZ, 2). Coccolithophores, a group of unicellular 
phytoplanktonic algae (also known as nan- 
noplankton), are the most productive marine 
calcifiers (3), but predicting their response to 
future environmental change has proven chal- 
lenging. Experiments testing the effects of 
high CO, and temperatures on living cocco- 
lithophores and their calcitic exoskeletons 
have shown apparently contradictory results 
within and between species (4-8). However, 
interpretations of geological and fossil-based 
evidence have evoked globally catastrophic re- 
sponses in nannoplankton during past inter- 
vals of high temperature and COs. Specifically, 
prominent studies have observed substantial 
declines in CaCO; and nannoplankton abun- 
dance through past global warming events— 
especially throughout the Mesozoic oceanic 
anoxic events (OAEs)—and have interpreted 
these signals as biocalcification crises, whereby 
OA, and related environmental change, directly 
compromised biogenic CaCOs production (re- 
viewed in data S1 to S3). Conversely, others 
have argued that these declines in CaCO; are 
caused by dissolution of carbonate at the sea- 
floor during these warming events and that 
independent evidence for nannoplankton re- 
sponses to OA must be better understood and 
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demonstrated before biocalcification crises are 
invoked (9, 10). Given that the biocalcification 
crisis paradigm continues to be widely applied 
to past warming episodes (data S1 to S3) and 
that this model predicts potentially disastrous 
changes to future marine biodiversity and car- 
bon cycle function, we tested this hypothesis 
using a novel methodology. 

We examined the biocalcification crisis 
associated with the Toarcian OAE [T-OAE; 
~183 million years ago (Ma)] in the Early 
Jurassic—considered as one of the most sev- 
ere such events of the past 200 million years. 
The T-OAE was a geologically rapid global 
warming event caused by volcanism in the 
Southern Hemisphere (17) and is characterized 
by a range of environmental, geological, and 
ecological changes, including high COs, OA, 
oceanic anoxia, the deposition of organic-rich 
sediments, a major negative carbon isotope 
excursion, and widespread extinction [(72) and 
references therein]. Previous interpretations of 
the biocalcification crisis are primarily based 
on declining CaCO; in the sedimentary rock 
record and decreased nannoplankton species 
abundances and sizes (data S1), but this evi- 
dence is dependent on conventional nanno- 
fossil analyses, whereby data are derived from 
calcite “body” fossils. Here we report on an over- 
looked form of preservation—namely, imprint 
(or “ghost”) nannofossils—which provides crit- 
ical information that may be lost from the 
more routinely studied body fossil record. 

Toarcian rock samples from the UK, Germany, 
Japan, and New Zealand (fig. S1) were pro- 
cessed for organic matter analysis. Scanning 
electron microscopy (SEM) of organic par- 
ticles revealed nannoplankton imprints pre- 
served on the surfaces of marine organic-walled 
plankton (dinoflagellate cysts, prasinophytes, 
and acritarchs), amorphous organic matter 


(AOM), and spores and pollen from land plants 
(Fig. 1). Imprints are also visible with the use 
of transmitted light, fluorescence, and con- 
focal microscopy (fig. S2), but fine details are 
evident when SEM is used, with specimens 
displaying diagnostic coccolith rims, radial 
and imbricating sutures, and fragile axial and 
radiating central structures (figs. S3 to S12). 
Preservation is often pristine, and digital in- 
version of imprint images provides “virtual 
casts” that assist in visualization and identi- 
fication of the original nannofossils (Fig. 1). 
Imprints were found as single or multiple 
specimens of the same or different species 
and cover a range of forms, including small 
coccoliths (<3 um; fig. S31), larger nanno- 
plankton (figs. S3, J to O, and S8, A to D), 
and collapsed coccolithophore exoskeletons 
(coccospheres) (figs. S5, A and B, and S8F). 
Nannoplankton imprints on organic matter 
have only occasionally been reported from 
the fossil record (13-15), presumably because 
of their cryptic mode of preservation and min- 
ute size. Some studies have interpreted im- 
prints as the negative molds of coccoliths that 
were dissolved during acid digestion of rock 
samples in the laboratory (/4). However, we 
recorded imprints on unprocessed rock sur- 
faces (fig. S6) and in samples devoid of CaCOs, 
indicating that these fossils occur naturally. 
Reduced nannoplankton abundances during 
the T-OAE have been reported from multiple 
locations (data S1), but this signal is most ex- 
treme in the Cleveland Basin, Yorkshire (UK), 
where a nannoplankton disappearance event 
has been observed (16). Samples from Yorkshire, 
which we studied using traditional nanno- 
plankton body fossil methods, were either 
barren or yielded rare or very rare nanno- 
fossils (figs. S13 and S14). These observations 
essentially replicate previous findings of 
reduced species abundances or absences 
throughout the T-OAE (J6) and seemingly 
support the biocalcification crisis paradigm. 
However, imprints from the same samples 
challenge this view, revealing abundant and 
rich nannoplankton communities throughout 
the T-OAE interval, refuting the disappear- 
ance event hypothesis. Imprints are not just 
confined to the Cleveland Basin but have been 
discovered in a wide range of depositional 
settings in globally distributed lower Toarcian 
strata (e.g., Germany, Japan, and New Zealand; 
Figs. 1 and 2). These results indicate that ob- 
served decreases in CaCO, and nannofossil 
abundance through the T-OAE are due to 
CaCO; dissolution after burial rather than 
representing a primary crisis of the living 
nannoplankton. The imprint fossils from 
Japan and New Zealand (Fig. 1) are the oldest 
coccoliths recorded from these countries, 
demonstrating that this approach is widely 
applicable and can expand nannofossil records, 
even in rocks where body fossils are absent and 
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Fig. 1. Ghost nannofossils imprinted on organic matter. (A) Staurolithites 
sp. imprint on Classopollis sp. (pollen) [NHMUK PM FM 2355 (2)]. 

(B) Crepidolithus impontus imprints on Cerebropollenites macroverrucosus 
(pollen) [NHMUK PM FM 2355 (2)]. (€) Bussonius prinsii imprint on 
prasinophyte alga [NHMUK PM FM 2355 (2)]. (D to K) Imprints on AOM. 
(D) B. prinsii [NHMUK PM FM 2355 (2)]. (E) Axopodorhabdus atavus [NHMUK 


have been subjected to thermal alteration [e.g., 
Japanese material studied here; see also (17)]. 
We further extended our study to test for 
imprints through two Cretaceous OAEs— 
the early Aptian OAEla (~120 Ma) and the 
Cenomanian-Turonian OAE2 (~94 Ma)—for 
which there are also claims of biocalcifica- 
tion crises (reviewed in data S2 and S3). The 
Cretaceous OAEs are also associated with dis- 
tinct episodes of volcanism and are character- 
ized by comparable suites of environmental, 
geological, and ecological changes [(12) and 
references therein]. We found imprint fossils 
through both Cretaceous OAEs (Figs. 1 and 2 
and figs. S11 and S12), demonstrating that 
ghost nannofossil preservation is not limited 
to the T-OAE and that observed decreases 
in CaCOz and nannoplankton abundances 
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during OAEla and OAE2 (data S2 and S3) are 
probably linked to the secondary removal of 
CaCO; from the rock record. Imprints from 
the OAE2 Contessa section, Italy, are partic- 
ularly important because these were recorded 
during an inferred nannoplankton biocalci- 
fication “blackout,” corresponding to the 
Bonarelli Level (a ~1-m-thick black shale vir- 
tually devoid of CaCO3), which abruptly inter- 
rupts a limestone succession rich in body 
fossils (fig. S14 and data S3). Our results over- 
turn the blackout hypothesis and indicate that 
the original CaCO; has been lost through post- 
burial dissolution, leaving a misleading signal 
of declining carbonate production during OAE2. 

The recurrence of imprints in OAE-related 
sediments demonstrates that these organic- 
rich intervals are especially prone to this type 


PM FM 2377 (1)]. (F) Calyculus serrai coccosphere [NHMUK PM FM 2377 (1)]. 
(G) Axopodorhabdus sp. [S043981]. (H) Lotharingius hauffii [5043957]. 

(|) Calyculus sp., Staurolithites sp., and others [S043993]. (J) Manivitella 
pemmatoidea [S043946]. (K) Stoverius achylosus [S043941]. Blue images 
are inverted virtual casts. Scale bars in overview images in (A) to (C) 

are 10 um; all other scale bars are 2 um. 


of nannofossil preservation—indeed, the OAE 
intervals record the richest imprint assem- 
blages (Fig. 2). Abundances of imprints and 
AOM are generally positively correlated (fig. 
$15), which suggests that plentiful organic 
matter was an important requirement, pro- 
viding the necessary “plastic” substrate for 
imprinting. This also explains the subsequent 
dissolution of CaCOz, because high amounts 
of organic matter can lead to acidic pore waters 
during diagenesis (78). The formation of im- 
prints also required overburden pressure before 
the loss of body fossils, indicating that disso- 
lution took place after burial, and the absence 
of compressed imprints reveals that this oc- 
curred after lithification. 

Nannofossil abundances in sedimentary 
rocks are the product of a range of factors, 
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Fig. 2. Ghost nannofossil, body nannofossil, and organic matter records through the T-OAE 


(Japan and UK), OAEla (Sweden), and OAE2 (Italy). Imprints/area denotes the number of imprints 
recorded across a standard area of organic matter. Body nannofossil abundance categories: Abundant, 
>10% of particles; Common, >1 to 10%; Frequent, 0.1 to 1%; Rare, <0.1%; Very Rare, <20 specimens 
in total; Barren, no body nannofossils. See materials and methods for further details, including richness 
and organic matter data collection methods. Note the different vertical scale for OAE2. Figure S14 is 
data from Germany and New Zealand. For raw data, 


an extended version of this figure that includes 
see data S4. 
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including those that affect original popula- 
tions (temperature, nutrients, water chem- 
istry), export pathways (grazing, ballasting), 
secondary abundances (exported plankton 
versus Siliciclastic dilution), and preservation 
(diagenesis). None of these factors necessarily 
disrupts calcification in the living nanno- 
plankton, and therefore preserved abundance 
changes alone do not provide evidence of bio- 
calcification crises. On the contrary, our ob- 
servations show that CaCO; and nannoplankton 
body fossil abundances can be severely modified 
or eradicated after deposition, indicating that 
these records are unreliable proxies for OA or 
pelagic carbonate production. Independent 
geochemical proxy evidence for OA at the 
T-OAE remains contentious (79, 20). Given the 
uncertainty about rates of carbon injection 
that drove this and other OAEs, these rates may 
well have been too slow to have induced pro- 
longed or high-magnitude surface water OA 
(10, 21). However, regardless of the severity or 
duration of OA during the T-OAE—or other 
proposed causes of a nannoplankton crisis, such 
as changes in temperature, salinity, nutrients, 
or anoxia (data S1)—our records challenge the 
concept of a crisis. More generally, these find- 
ings call for the reexamination of other inferred 
biocalcification crises, and ghost nannofossils 
represent a tool with which to test such claims. 

Our imprint record shows that nanno- 
plankton communities were more resilient to 
environmental changes during the T-OAE— 
including high CO, and warming, evident 
from independent proxies [(72) and references 
therein]—than traditional nannofossil and 
CaCOz records suggest. However, several pre- 
viously observed species-specific changes, such 
as declines in Schizosphaerella punctulata and 
nannoconids during the T-OAE and OAEIa, 
respectively, may still represent primary re- 
sponses of nannoplankton to environmental 
change (data S1 and S82). At the community 
level, however, the imprint record shows that 
nannoplankton flourished during the T-OAE, 
and their resilience is supported by observa- 
tions of increased speciation rates and an ab- 
sence of elevated extinctions across the T-OAE 
and OAEla (data S1 and 82). 

The abundance of prasinophyte algae in 
many T-OAE intervals has been interpreted as 
a rise to dominance at the expense of nanno- 
plankton (16), but the close association of im- 
prints and prasinophyte fossils observed here 
demonstrates that both groups coexisted or 
occurred in close succession (Fig. 1C). Near- 
monospecific assemblages of prasinophytes 
during the T-OAE likely represent persistent 
algal blooms (22); similarly, the monospecific 
concentrations of nannofossil imprints (Fig. 1F 
and figs. S3B and S7A) are pellets or aggregates 
that provide snapshots of high-dominance 
communities. Rather than being considered 
casualties of the T-OAE, our findings indicate 
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Fig. 3. Schematic summary of the major changes in phytoplankton groups through the T-OAE, showing changes in phytoplankton export and the forma- 
inimum zone (OMZ) expands during the OAE and that, after the OAE, the acidic pore waters within subsurface 
sediments lead to the dissolution of nannoplankton body fossils and the formation of imprints. 


tion of nannofossil imprints. Note that the oxygen m 


that nannoplankton continued to draw down 
CO. and sequester carbon in seafloor sedi- 
ments, which in the long-term likely expe- 
dited the termination of the event. However, 
high production, blooms, and potential toxic- 
ity (23) suggest that in the short term nanno- 
plankton, like prasinophytes, fueled anoxia 
through eutrophication and increased accu- 
mulation of organic matter at the seafloor, en- 
hanced by coccolith ballasting (24) (Fig. 3). 
Our records of ghost nannofossils, discov- 
ered through unconventional methods, indicate 
no evidence of biocalcification crises during the 
studied Mesozoic OAEs, at least for plankton 
that form calcite, the more stable CaCOs3 poly- 
morph compared to aragonite. Instead, these 
findings show how diagenesis can completely 
reshape the geological archive and highlight 
that a literal reading of the fossil record can 
mislead interpretations. Given that atmospheric 
CO, concentrations are currently rising at pre- 
viously unseen rates, the use of individual OAEs 
as past analogs of current change may be pre- 
mature because carbon-input rates—and there- 
fore the duration and intensity of surface water 
OA—remain uncertain for these events (JO, 27). 
Nevertheless, our imprint record demonstrates 
the resilience of nannoplankton communities 
during multiple past global warming events 
and shows that plankton proliferation can ac- 
celerate the development of OAEs. Our findings 


also indicate that the conditions that prevailed 
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during OAEs may become more prevalent 
(25, 26), with plankton blooms and hypoxic 
dead zones becoming widespread across glob- 
ally warming oceans. 
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Recognition of pathogen-derived sphingolipids 

in Arabidopsis 

H. Kato’, K. Nemoto”, M. Shimizu’, A. Abe”, S. Asai*, N. Ishihama®, S. Matsuoka‘, T. Daimon’, 
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Y. Takano’, R. Terauchi>?* 


In plants, many invading microbial pathogens are recognized by cell-surface pattern recognition receptors, 


which induce defense responses. Here, we show that the ceramide Phytophthora infestans—ceramide D 


(Pi-Cer D) from the plant pathogenic oomycete P. infestans triggers defense responses in Arabidopsis. Pi-Cer D 
is cleaved by an Arabidopsis apoplastic ceramidase, NEUTRAL CERAMIDASE 2 (NCER2), and the resulting 


9-methyl—branched sphingoid base is recognized by a plasma membrane lectin receptor-like kinase, 
RESISTANT TO DFPM-INHIBITION OF ABSCISIC ACID SIGNALING 2 (RDA2). 9-Methyl—branched sphingoid 


is specific to microbes and induces plant immune responses by physically interacting with RDA2. Loss of 
RDA2 or NCER2 function compromised Arabidopsis resistance against an oomycete pathogen. Thus, we 


elucidated the recognition mechanisms of pathogen-derived lipid molecules in plants. 


lants defend themselves against a mul- | then activates immune signaling (J-3). 
titude of microbial pathogens by sensing 
pathogen invasion through cell-surface 
pattern recognition receptors (PRRs) that 
recognize microbe- or pathogen-associated 

molecular patterns (MAMPs, PAMPs) or damage- 

associated molecular patterns (DAMPs) of host- 
derived molecules that result from damage 


caused by pathogen attack. This recognition 
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B 


Fig. 1. Arabidopsis RDA2 and 
NCERZ2 are required for recogni- 
tion of Pi-Cer D and resistance 
against H. arabidopsidis. 

(A) Chemical structure of Pi-Cer D. 
(B) Bioluminescence response of 
Arabidopsis pWRKY33-LUC reporter 
(WT, wild type), rda2-4, and 
ncer2-2 after Pi-Cer D (0.17 uM) 
treatment (means + SD). For 
additional data, see fig. S3. 

(C) Bioluminescence of WT and 
mutant seedlings after treatment 
with Pi-Cer D, flg22, elf18, or chitin. 
Relative peak bioluminescence 
values are shown as a percentage 
of the WT (means + SE). For 
additional data, see fig. S4. (D and 
E) Gene and protein structures of 
RDA2 (D) and NCER2 (E). Gene 
structure (top), showing exons in 
boxes and introns as lines between 
the boxes. Protein structure 
(bottom), showing the different 
domains. The positions of the 
mutations in different alleles 
(closed triangle) and T-DNA 
insertion sites (opened triangle) are 
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Arabidopsis genome contains genes encod- 
ing ~580 PRRs, including ~410 receptor-like 
kinases (RLKs) and ~170 receptor-like proteins 
(RLPs) that lack the kinase domain (4). How- 
ever, molecular interactions between MAMPs 
or PAMPs and PRRs have only been demon- 
strated in a limited number of cases, with the 
majority involving peptides, proteins, and car- 


bohydrates. Lipids derived from microbes are 
also recognized by plant PRRs. The Arabidopsis 
PRR LIPOOLIGOSACCHARIDE-SPECIFIC 
REDUCED ELICITATION recognizes medium- 
chain 3-hydroxy fatty acids of bacterial patho- 
gens (5), but whether other PRRs recognize 
pathogen lipids remains unknown. 
Ceramides belong to a class of sphingolipids 
consisting of a sphingoid base and a fatty acid 
and are present at high concentrations in 
eukaryotic cell membranes. Ceramide and its 
metabolites are also involved in intracellular 
signal transduction in animal cells and plants 
(6, 7). Phytophthora infestans, an oomycete 
pathogen, is the causal agent of the potato 
late blight that devastated potato crops and 
caused famine in the 19th century. It is still 
a threat to potato production worldwide. A 
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indicated. B-lectin, an extracellular b-mannose-binding lectin domain; SLG, S-locus glycoprotein family domain. (F) Complementation of rda2 and ncer2 mutants with 


wild-type al 


eles. Bioluminescence (means + SE) of WT, rda2-4, ncer2-2, and complemented lines after Pi-Cer D (0.17 uM) treatment is shown. For additional data, see fig. S7. 


(G) Growth of H. arabidopsidis Waco9 on Arabidopsis WT, rda2-4, ncer2-2, and complementation lines. Conidiospores were harvested and counted 5 days after inoculation 
(n = 5). *p < 0.05 in two-tailed t tests comparing the corresponding values from WT. Experiments were performed three times with similar results. 
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ceramide-related compound, P. infestans cer- 
amide D (Pi-Cer D, Fig. 1A), induces immune 
responses in potato plants (8). Pi-Cer D also 
induces defense responses in Arabidopsis (8); 
therefore, we aimed to identify the Arabidopsis 
components involved in the perception of Pi- 
Cer D. For this, we used Lumi-Map, a platform 
consisting of a luciferase (LUC)-based mutant 
screen and gene identification (fig. S1) (9). Be- 
cause the AvabidopsisWRKY33 gene is induced 
by PAMPs, including flg22, a peptide derived 
from bacterial flagellin, and is required for 
resistance against pathogens (0, 11), we tested 
whether Pi-Cer D induced the expression of 
a LUC transgene driven by the WRKY33 pro- 
moter (DWRKY33-LUC) in Arabidopsis, which 
resulted in a transient induction of biolumi- 
nescence (fig. S2). We then screened 10,000 My 
seedlings generated by ethyl methanesulfonate 
mutagenesis of the pWRKY33-LUC reporter 
line (W33-1B) for mutants that showed a re- 
duction in bioluminescence after Pi-Cer D treat- 
ment [named low (L) mutants]. We isolated 
nine mutants insensitive to Pi-Cer D (L-09, 
L-12, L-16, L-19, L-31, L-46, L-55, L-66, and 
L-74) and two mutants with a low response to 
Pi-Cer D (L-53 and L-107) (Fig. 1B, fig. $3, and 
table S1). These mutants showed normal bio- 
luminescence responses to other PAMPs, such 
as flg22, elf18 (derived from bacterial elonga- 
tion factor Tu), and chitin, a component of 
fungal cell walls, indicating that they carried 
lesions affecting the signaling pathway that is 
required for the specific response to Pi-Cer D 
(Fig. 1C and fig. $4). 

To identify the gene(s) altered in these mu- 
tants, we performed MutMap analysis (12). 
All nine Pi-Cer D-insensitive mutants showed 
single-nucleotide polymorphism (SNP)-index 
peaks on chromosome 1 and contained SNPs 
within the gene Atig11330 encoding a lectin 
receptor-like kinase RESISTANT TO DFPM- 
INHIBITION OF ABSCISIC ACID SIGNALING 2 
(RDA2) (Fig. 1D, fig. S5, and table S2), a mutant 
of which (7rda2) is insensitive to a small syn- 
thetic molecule, [5-(3,4-dichlorophenyl)furan- 
2-yl]-piperidine-1-ylmethanethione (DFPM), 
and incapable of mounting DFPM-mediated 
immune signaling and inhibition of abscisic 
acid signaling (73). RDA2 belongs to the S-locus 
receptor kinase (SRK)-like kinase family, and 
its homologs are found in potato (fig. S6). 
Thus, we tentatively named the nine Pi-Cer D- 
insensitive mutants rda2-4 through rda2-10. 
The two Pi-Cer D low-response mutants showed 
SNP-index peaks on chromosome 2 and car- 
ried mutations in the gene At2g38010, which 
encodes neutral ceramidase 2 (NCERQ2; Fig. 1E, 
fig. S5, and table S2) (14). Potato has a homo- 
log of NCER2 (fig. S6). We then tentatively 
named the L-53 and L-107 mutants ncer2-2 
and ncer2-3, respectively. Complementation of 
the rda2 and ncer2 mutant lines by the respec- 
tive wild-type alleles restored bioluminescence 


858 20 MAY 2022 + VOL 376 ISSUE 6595 


> 
B 
o 

ow 
) 
a 


w 
3 
ie) 
io} 
° 
° 


a 


Ie) 
3 


Bioluminescence 
(x 104 cps/plant) 
to) 


Bioluminescence 
(x 104 cps/plant) 


AWF from A.t. 


D Sphingoid base 


(4E,8E,10£)-9-methyl-4,8, 10-sphingatrienine 


standard 


WT + Pi-Cer D 


Total 


AWF 


HA-NCER2Ner 
IB: a-HA 


actin 
IB: a-actin 


Pi-Cer D 


rda2-4 + Pi-Cer D 


ncer2-2 + Pi-Cer D t 


Time 
(min) 


Medium 
(genotype + treatment) 


10. tt 612 0) 10 20 30 


Bioluminescence 
(x 104 cps/plant) 


Fig. 2. NCER2 cleaves Pi-Cer D to produce a sphingoid base in the apoplast. (A) Pi-Cer D is cleaved 

by the Arabidopsis ceramidase NCER2. HA-NCER2 was expressed and purified from N. benthamiana. The lipid 
fraction containing metabolites derived from Pi-Cer D incubated with HA-NCER2 was recovered and applied 
to ncer2-2, and their bioluminescence was measured. Heat-inactivated HA-NCER2 and mouse ceramidase 
(mouse CD) were used as controls. Peak bioluminescence is shown (means + SE). For additional data, 

see fig. S12. (B) Pi-Cer D-cleavage activity of Arabidopsis (A.t.) apoplast wash fluid (AWF). AWFs were isolated 
from the pWRKY33-LUC reporter line (WT), ncer2-2, and ncer2-2 HA-NCER2 lines. Peak bioluminescence is 
shown (means + SE). For additional data, see fig. S13. (C) Immunoblot (IB) analysis of total protein and AWF. 
Actin-specific antibody was used to detect cytosolic protein. (D) Detection of sphingoid base in the growth 
medium of plant treated with Pi-Cer D. (Left) Spectra obtained by liquid chromatography-mass spectrometry 
of sphingoid base recovered from growth media. (4E,8£,10E)-9-methyl-4,8,10-sphingatrienine and Pi-Cer D 
were used as a standard. Values shown are relative ones normalized to the highest peaks. Background peaks 
are denoted by an asterisk. (Right) pWRKY33-LUC bioluminescence induction activity of the recovered lipid to 
ncer2-2. Peak bioluminescence values are shown (means + SE). Additional data are available in fig. S18. 
Experiments were performed two [(A) and (D)] or three times [(B) and (C)]. 


induction after Pi-Cer D treatment, confirm- 
ing that RDA2 and NCER2 are the responsi- 
ble genes for the given phenotypes (Fig. 1F 
and fig. S7). Furthermore, transfer DNA 
(T-DNA) insertion mutant lines for RDA2 
and NCER2 showed either no or reduced 
induction of WRKY33 gene expression after 
Pi-Cer D treatment (figs. S8 and S9). Collec- 
tively, these results indicate that RDA2 and 
NCER2 are required for Pi-Cer D recogni- 
tion in Arabidopsis. We then asked whether 
RDA2 and NCER2 contribute to Arabidopsis 
immunity against the oomycete pathogen 
Hyaloperonospora arabidopsidis, which also 
produces Pi-Cer D (fig. S10) as P. infestans, 
a nonpathogen of Arabidopsis. Both rda2 


and ncer2 mutants showed increased sus- 
ceptibility to H. arabidopsidis and comple- 
mentation by the respective wild-type alleles 
restored resistance (Fig. 1G), indicating that 
RDA2 and NCER2 are required for resistance 
to this pathogen (fig. S11). 

We hypothesized that (i) Pi-Cer D is cleaved 
by NCER2 into a mature ligand product in the 
apoplastic space, and (ii) the ligand is recog- 
nized via plasma membrane-localized RDA2. 
To test the first hypothesis, we investigated 
whether the ncer2 mutant phenotype was 
rescued by the product generated by NCER2 
ceramidase treatment of Pi-Cer D. The lipid 
fraction containing Pi-Cer D and NCER2 pro- 
duced in Nicotiana benthamiana, as well as that 
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Fig. 3. RDA2 recognizes 9-methyl sphingoid base and induces defense responses. (A) Structures of 
(4E,8E)-9-methyl-4,8-sphingadienine (QMe-Spd), (4E,8E)-4,8-sphingadienine (Spd), and sphingosine (Sph). 
(B and C) Peak bioluminescence values for Arabidopsis pWRKY33-LUC reporter seedlings (WT) treated with 
(B) different concentrations of 9Me-Spd or (C) 0.5 uM sphingoid bases (means + SE). For additional data, 


see figs. S19 and S21. 


(D) Expression of defense genes in Arabidopsis seedlings 3 hours after elicitation 


with sphingoid bases (0.5 uM) relative to WT (DMSO, dimethyl sulfoxide). Individual data (symbols) and 
means (bars) are shown (n = 3); *p < 0.01 (two-tailed t-test). (E) Mitogen-activated protein kinase (MAPK) 
activation after treatment with sphingoid bases (0.5 uM). (F) ROS accumulation in Arabidopsis leaves. 
(Top) Leaf disks from WT treated with 30 uM sphingoid bases (n = 12). (Bottom) Leaf disks from WT 

and rda2-4 treated with 30 uM 9Me-Spd (n = 10). Relative light units (RLU) are shown (means + SD). (G) SPR 
binding kinetics analysis of the wild-type ectoRDA2 with 9Me-Spd (top) or Pi-Cer D (bottom). RU, resonance 
unit. For additional data, see figs. S25 to S27. Experiments were performed two [(B), (D), (F), and (G)] 


or three times [(C) and (E)] with similar results. 


containing Pi-Cer D and a mouse ceramidase, 
induced pWRKY33-LUC bioluminescence in 
the ncer2-2 mutant (L-53) (Fig. 2A, fig. $12, 
and table $3). We observed no biolumines- 
cence when heat-inactivated NCER2 was 
used. These results indicate that Pi-Cer D was 
cleaved by NCER2-encoded ceramidase, and 
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the resulting compound was recognized by 
RDA2. We detected the hemagglutinin (HA)- 
NCER2 protein and Pi-Cer D-cleavage activity 
in the apoplast wash fluid (AWF) of ncer2-2 
HA-NCER2 complementation lines (Fig. 2, B 
and C, and fig. $13). Mass spectrometry analysis 
of compounds in the lipid fraction prepared 


from a mixture of Pi-Cer D and ceramidase de- 
tected a sphingoid base, (4#,8#,10E)-9-methyl- 
4,8,10-sphingatrienine (9Me,44,8E,10E-d19:3, 
hereafter 9Me-Spt), which might be the li- 
gand for RDA2 (fig. S14 to S17). We detected 
9Me-Spt in the growth medium of the wild- 
type plant and the rda2 mutant, but not in that 
of ncer2 mutant of Arabidopsis plants treated 
with Pi-Cer D (Fig. 2D and fig. S18). Detection 
of 9Me-Spt in the medium correlated with its 
ability to induce WRKY33 expression, suggest- 
ing that apoplastic NCER2 cleaves Pi-Cer D to 
9Me-Spt, which is recognized by RDA2. 
9Me-Spt contains a branching methyl group 
at the ninth carbon position. This 9-methyl- 
branching structure is present in sphingoid 
bases of oomycete, fungi, and marine inver- 
tebrates but has not been reported in plants 
and mammals (15-17). We thus hypothesized 
that the 9-methyl-branching structure of the 
sphingoid base plays a role in distinguishing 
between “self” and “nonself” in plants. There- 
fore, we investigated the ability of various 
sphingoid bases to induce a defense response. 
Because 9Me-Spt was difficult to obtain, 
we used (4#,8F)-9-methyl-4,8-sphingadienine 
(9Me,4E,8E-d19:2, hereafter 9Me-Spd) to eval- 
uate the 9-methyl structure (Fig. 3A and fig. 
$19). The rda2 mutants were almost com- 
pletely insensitive to 9Me-Spd, indicating that 
9Me-Spd is specifically recognized by RDA2 
(fig. S19). Among the sphingoid bases that we 
tested, 9Me-Spd showed the strongest RDA2- 
dependent elicitor activity (Fig. 3, B and C, and 
figs. S19 and 820). In addition, (4F,8£)-4,8- 
sphingadienine (44,8E-d18:2, Spd) and sphin- 
gosine (4#-d18:1, Sph), neither of which contain 
9-methyl branching, also showed elicitor ac- 
tivity, although this was significantly weaker 
than that of 9Me-Spd (Fig. 3C and figs. S20 
and 821). To identify the structural correlates 
of RDA2-dependent sensing of the sphingoid 
base, we tested sphingosine derivatives with 
different lengths of long-chain bases. Among 
these derivatives, Sph (4£-d18:1) induced the 
highest bioluminescence in the pWRKY33-LUC 
reporter line, followed by 4#-d16:1, 4#-d14:1, 
and 4£-d12:1 (fig. S22). This indicates that 
efficient sensing by RDA2 requires a long- 
chain base structure that includes 18 car- 
bon atoms. We also tested phytosphingosine 
(4-hydroxysphinganine, 4-t18:0, PHS) and 
found that it did not elicit bioluminescence 
in the pWRKY33-LUC reporter line. This indi- 
cates that the 4/ double-bond structure in Sph 
is crucial for its sensing by RDA2 (fig. S23). 
To further investigate the downstream events 
following RDA2-mediated sensing, we tested 
the ability of 9Me-Spd and its derivatives to 
activate Arabidopsis immune responses. The 
9Me-Spd activated RDA2-dependent biolumi- 
nescence induction, transcript accumulation 
of defense-related genes (FRKI, Atlg51890), 
phosphorylation of mitogen-activated protein 
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kinases, and the production of reactive oxygen 
species (ROS) more strongly than did Spd and 
Sph (Fig. 3, C to F, and fig. S24). To investigate 
the physical interaction between RDA2 and 
sphingoid bases, we performed a thermal shift 
assay and a surface plasmon resonance (SPR) 
assay using the extracellular domain of RDA2 
(ectoRDA2) produced in silkworm (fig. S25). 
In both assays, the wild-type ectoRDA2 bound 
9Me-Spd, Spd, and Sph at a similar strength 
(dissociation constant Kg of 0.31 uM for 9Me- 
Spd, 0.43 uM for Spd, and 0.49 uM for Sph), 
whereas the mutant ectoRDA2 did not (Fig. 
3G, figs. S26 and S27, and tables S4 and 
85). Pi-Cer D was not bound by ectoRDA2. A 
protein-lipid overlay (PLO) assay using a pu- 
rified RDA2 protein showed similar results 
(fig. S28). By contrast, PLO with a membrane 
fraction containing HA-tagged RDA2 showed 
higher interaction signals between 9Me-Spd 
and RDA2 than that of Sph and RDA2 (figs. 
$29 and S30), suggesting the possible presence 
of another component that determines the 
recognition specificity of sphingoid bases by 
RDA2. Collectively, these results suggest that 
RDA2 is the receptor for sphingoid bases, 
including 9-methyl sphingoid base, which is 
derived from Pi-Cer D. 

Sphingolipids such as ceramide are com- 
ponents of eukaryotic membranes (16-18). 
Our findings revealed that a plant apoplastic 
ceramidase cleaves an oomycete-derived cer- 
amide, and the resulting sphingoid base is 
recognized by a lectin receptor-like kinase 
(fig. S31). The 9-methyl-branching structure of 
sphingoid bases is prevalent in oomycetes 
and fungi, and plants perceive the abundance 
of nonself sphingolipid to mount defense. 
RDA2 is required for immune signaling and 
inhibition of abscisic acid signaling by a small 
synthetic molecule, DFPM, as identified by a 
chemical genetic screen (13). We hypothesize 
that DFPM or its metabolized product func- 
tions as a mimic of sphingoid base, but further 
study is required to clarify this. Our study pro- 
vides a basis on which to engineer RDA2 to 
detect various pathogen-specific lipids and 
to enable plants to mount a stronger defense 
against devastating crop pathogens such as 
P. infestans. 
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SUPERCONDUCTIVITY 


Enhanced charge density wave coherence in a 
light-quenched, high-temperature superconductor 


S. Wandel?, F. Boschini2*“, E. H. da Silva Neto®*”, L. Shen'®, M. X. Na, S. Zohar’, Y. Wang“t, 
S. B. Welch’, M. H. Seaberg’, J. D. Koralek?, G. L. Dakovski?, W. Hettel’, M.-F. Lin’, S. P. Moeller’, 
W. F. Schlotter’, A. H. Reid’, M. P. Minitti, T. Boyle™®”, F. He®, R. Sutarto®, R. Liang”?, D. Bonn”%, 
W. Hardy”, R. A. Kaindl?°“4, D. G. Hawthorn’, J.-S. Lee’, A. F. Kemper", A. Damascelli7’, 


C. Giannetti*, J. J. Turner?®, G. Coslovich'* 


Superconductivity and charge density waves (CDWs) are competitive, yet coexisting, orders in 
cuprate superconductors. To understand their microscopic interdependence, a probe capable of 
discerning their interaction on its natural length and time scale is necessary. We use ultrafast 
resonant soft x-ray scattering to track the transient evolution of CDW correlations in YBazCu30¢,., 
after the quench of superconductivity by an infrared laser pulse. We observe a nonthermal 
response of the CDW order characterized by a near doubling of the correlation length within 

=1 picosecond of the superconducting quench. Our results are consistent with a model in which 
the interaction between superconductivity and CDWs manifests inhomogeneously through disruption 
of spatial coherence, with superconductivity playing the dominant role in stabilizing CDW topological 


defects, such as discommensurations. 


nconventional superconductivity often 

emerges in proximity to other low-energy 

broken-symmetry states, including anti- 

ferromagnetism, spin order, and charge 

density waves (CDWs). These orders exist 
on similar energy scales and often become 
“intertwined” (J, 2), manifesting a complex 
interplay arising from strong correlations. 
The presence of CDW order—a periodic modu- 
lation of the electronic density—is ubiquitous 
in cuprates (3-74) and has been detected by 
x-ray scattering (7-11, 13-15), scanning tun- 
neling spectroscopy (STS) (4-6, 10, 12, 16), and 
nuclear magnetic resonance (NMR) (/7, 18). 
The periodicity of such CDWs is generally in- 
commensurate; however, STS and NMR exper- 
iments have revealed locally commensurate 
CDW patterns (4, 5, 12, 17, 18). CDW order can 


coexist with superconductivity below the super- 
conducting critical temperature (T,), and its 
temperature dependence suggests a tendency 
to compete with superconductivity (8, 70). In 
YBayCu30¢.2 (YBCO), high magnetic fields 
suppress superconductivity and lead to the 
enhancement of CDW ordering (15, 17, 19). 
More recently, CDW enhancement has been 
achieved by means of applying uniaxial pres- 
sure (20), yet dynamical information about 
the interaction between order parameters has 
not been accessed so far. 

An emerging approach to studying inter- 
twined orders is to measure their response 
to ultrafast photoexcitation in the time do- 
main. Ultrafast laser pulses enable selective 
quenching of orders, thereby inducing an en- 
hancement of the competing orders (21-24). 
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The use of free-electron lasers enables ultra- 
fast x-ray scattering experiments in charge- 
ordered systems. However, such studies have 
thus far centered on the photoinduced melt- 
ing dynamics of charge order (25-29). In this 
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YBa,Cu305 67 
ii —— — 
X-ray probe 


931.5 eV, 100 fs 


Optical laser pump 
1.5 eV, 100 fs 


/ (arb. units) 


Detector 
Qepw = (0.31, 0, 1.45) 


Fig. 1. Experimental conditions and time-domain results. (A) Diagram of 
the experiment. Optical laser pulses perturb the sample, and x-rays pulses 
probe the CDW peak. The momentum space is explored by rocking the sample. 
Both a two-dimensional area detector and a point detector (gray rectangle) 
were used to collect the scattered light. Two representative H scans, Q = (H, 0, 
1.45), are shown in the inset, after subtraction of the fluorescence background, 
at temperatures of 12 and 65 K (36). (B) Schematic phase diagram of 
YBazCuz0¢+x showing the two temperatures of interest, within (20 K) and just above 
(65 K) the superconducting (SC) dome for x equal to 0.67 (hole doping p = 0.12). 
(C) CDW signal at 65 K as a function of the delay time, t, between the pump and 
probe pulses, with a laser pump fluence of 0.16 mJ/cm”. The solid line is a fit 

to exponential relaxation dynamics. (D) Fluence dependence of the normalized 
maximum transient optical reflectivity, AR/R (blue circles), and x-ray scattering 
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work, we focus instead on unveiling the inter- 
action between superconductivity and CDWs 
in the time domain. We do so by performing 
time-resolved resonant soft x-ray scattering on 
YBCO, tracking the CDW response to a laser- 
driven ultrafast quench of superconductivity. 
To probe electronic modulations connected 
to the CDW, we used an incident photon en- 
ergy of 931.5 eV, resonant with the Cu L,-edge, 
and scanned the momentum transfer around 
wave vector Qcpw = (0.31, 0, 1.45) (Fig. 1A). 
The sample was photoexcited by 100-fs infrared 
laser pulses, which were variably delayed in 
time with respect to the 100-fs x-ray pulses. 
The doping of the sample (@ = 0.67) was 
selected to be at the apex of the CDW dome 
(Fig. 1B) in order to maximize the competi- 
tion with superconductivity. The experiment 
was performed in the superconducting phase 
(<20 K) and, for comparison, at T, (65 K), 
where the CDW scattering intensity is maxi- 
mal [Fig. 1A and (8)]. It has been shown that 
800-nm laser pulses strongly perturb super- 
conductivity in cuprates, inducing a nonther- 
mal phase transition within a few hundreds 
of femtoseconds (23, 30-33). Here, we kept the 
fluence of the pump pulses near the minimum 
needed to quench the superconducting phase 
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(32), which is 0.05 mJ/cm?, such that parasitic 
local heating is minimized. 

The photoinduced response of the CDW 
state in the time domain is tracked by probing 
the peak intensity integrated around Qcpw 
at varying time delays with respect to the 
pump pulse. At 7, the pump induces a prompt 
resolution-limited decrease of the CDW peak 
intensity (Fig. 1C), followed by a ~3-ps recov- 
ery. This dynamic behavior is similar to the 
well-known photoinduced melting in other 
charge-ordered systems (25-28, 34, 35). How- 
ever, below TJ, (at 20 K) the pump yields the 
opposite effect, inducing an enhancement of 
the CDW peak signal with notably different 
dynamics (Fig. 1E). Whereas at 65 K a rela- 
tively high pump fluence (0.16 mJ/cm?) is 
necessary to observe the prompt CDW melting 
with good signal-to-noise ratio, at 20 K, a low 
pump fluence (0.05 mJ/cm?) produces a large 
enhancement of the CDW signal, with a rise 
time of 1.5 ps and a considerably longer decay 
time (7 ps). This peculiar response, appearing 
only below T,, is clearly linked to the pres- 
ence of the superconducting condensate. We 
reiterate that the low-fluence pump allows 
for the transient quenching of superconduc- 
tivity while minimizing heating effects and 
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photoinduced enhancement (purple circles) at 20 K (36). The blue line is a fit with 
an exponential function that represents saturation of the superconducting response. 
area highlights fluences below the saturation point. (E) Optical (blue 
circles) and x-ray scattering (purple circles) signals measuring the super- 
conducting and CDW dynamics, respectively, at 20 K with a laser pumping 
fluence of 0.05 mJ/cm?. The solid purple line indicates a fit of the x-ray 
scattering dynamics, where the largest component is a positive enhancement 
of the CDW signal (36). The gray dashed line represents the estimate of quasi- 
thermal heating effects (36). To match the fluence value, the optical data 

are obtained by averaging the two closest curves from the full fluence- 
dependence characterization (fig. S4). In (C) and (E), the black vertical dotted 
lines mark time zero, and the vertical dashed lines mark the delays considered 
in Fig. 2, at which photoinduced variations are maximal. 
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Fig. 2. X-ray scattering profiles of the CDW peak before and after 
photoexcitation. (A and B) H-scan plots of the CDW scattering signal, after 
background subtraction, in equilibrium [yellow symbols in (A) and gray 
symbols in (B)] and after photoexcitation [red symbols in (A) and purple 
symbols in (B)] at 65 K (A) and 12 K (B). The pump fluence was 0.05 mJ/cm?. 
Vertical dashed lines mark the wave vector of the peak at negative delays. 
Similar results to (B) were obtained using a different detector and at a 
temperature of 20 K (fig. S2) (36). The inset in (A) shows the cut of 
peak in the (H, K) reciprocal plane. Error bars are 1 SD. (€ and D) Co 


keeping the temperature below 7... The maxi- 
mum rise of the lattice temperature is esti- 
mated to be about 20 K, and the dashed line 
in Fig. 1E displays the estimated heat-induced 
changes in reflectivity (36). In addition, the 
CDW enhancement signal saturates at this 
fluence (Fig. 1D), and for higher fluences 
(>0.1 mJ/cm”), the CDW response switches to 
a fast melting and a slow enhancement, with a 
fluence-dependent cross-over time (fig. $3) 
(36). Notably, this enables full optical con- 
trol over the CDW phase, switching between 
enhancement and suppression by tuning the 
laser power. 

To better understand these results and draw 
a direct comparison with the dynamics of the 
superconducting order, we performed com- 
plementary transient optical reflectivity mea- 
surements at 800 nm. At this wavelength, 
we expect to observe the well-known spectral 
weight transfer from the low-energy super- 
conducting gap to interband transitions (23). 
The transient optical reflectivity signal is 
dominated by the superconducting response, 
as confirmed by its temperature and fluence 
dependence (fig. S4) and by comparison to 
transient mid-infrared spectroscopy resonant 
to the superconducting gap on an analogous 
sample (fig. S5) (36, 37). The optical reflec- 
tivity dynamics show typical features that 
result from the quench of the superconduct- 
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ing state—the signal saturates at a critical 
fluence, (~, (23, 30-32) (Fig. 1D), above which 
a flat-top-shaped dynamics develops (fig. S4) 
(33, 36). When we normalize the transient 
reflectivity by its saturated value, we note 
that at the fluence of 0.05 mJ/cm?, super- 
conductivity is almost completely quenched 
(>90%) in the probing volume. Superconduc- 
tivity recovers on a similar time scale as the 
CDWs (Fig. 1E); however, their early dy- 
namics are clearly distinct and cannot be 
described by a thermodynamic variable, such 
as an effective temperature. After subtracting 
a small residual melting signal, we note that 
the CDW enhancement happens on a time 
scale of ~1 ps, reaching a maximum around 
=2 ps (36), whereas the quench of the super- 
conductivity is complete within 300 fs. Thus, 
within the first ~2 ps after photoexcitation, we 
observe the nonthermal dynamical reaction 
of CDWs to the quench of superconductivity. 
To obtain a snapshot of the CDW spatial 
correlations after photoexcitation, we scanned 
the H direction, Q = (H, 0, 1.45), in reciprocal 
space (Fig. 2) at the delay times where photo- 
induced variations are maximal (~0.5 ps at T, 
and ~2 ps below T,). At 65 K, the CDW peak 
signal suppression is given by a decrease of 
integrated intensity (16%), as well as a de- 
crease of correlation length (25%). This re- 
sponse is comparable, for example, to the 
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length & along the a axis (C) and normalized integrated intensity J, (D) 
as a function of temperature measured at equilibrium (black circles) 
and under photoexcitation at negative delays (blue triangles) and around 
a =2-ps delay time (red triangles). For the data at 65 K, the 2-ps values 
were extrapolated from 0.5 ps, assuming the decay dynamics reported 
in Fig. 1C. The red shading highlights the temperature trend after photo- 
excitation. Error bars are evaluated as 1 SD for the fit coefficient. The 
error bars on the temperature value of the pumped data reflect possible 


ultrafast CDW melting in tritellurides (38), 
with the variation in correlation length com- 
monly attributed to the formation of topo- 
logical defects (25, 38). When the system is 
cooled to well below T, (12 K; Fig. 2B), the H 
scan reveals a considerable CDW peak en- 
hancement of +120% after photoexcitation. 
The CDW peak intensity increase is mostly 
caused by the dramatic peak narrowing, in- 
dicating a ~90% increase of the correlation 
length, from 36 + 3 Ato 69 + 5 A, that is, 
almost a doubling in size of coherent CDW 
domains. Concurrently, the integrated inten- 
sity increases, but by a smaller amount, between 
10 and 20%. We also note a ~0.003-reciprocal 
lattice units (r.l-u.) momentum shift at 12 K. 
This shift is not present in the melting case 
(65 K), suggesting that it is not related to 
heating or to CDW sliding effects that are 
directly induced by light (35). 

The nature of the nonthermal CDW state 
becomes evident when comparing the cor- 
relation length along the a axis (€,) and the 
integrated intensity (/j,,) obtained after 
photoexcitation with a conventional equilibrium 
synchrotron-based temperature-dependence 
study (Fig. 2, C and D). The values of J;,, and 
€ at 65 and 12 K that are obtained at the 
free-electron laser before photoexcitation 
(blue triangles) are in line with the equilib- 
rium characterization (black open circles). 
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Fig. 3. Scenarios for superconductivity and CDW interactions. (A to C) Models of the spatial arrangements 
of CDW and superconductivity (SC) regions before and after photoexcitations. The first column represents 
the ground state in each case, and the second column represents the state after the quench of the 
superconducting state by a laser pulse. The third column shows the simulated scattering profile for each of 
these cases for the equilibrium (gray solid lines) and photoexcited (purple dashed lines) cases (36). 

The profiles are compared to the experimental data from Fig. 2B (purple solid lines and symbols). The 
three scenarios considered are coexisting homogeneous superconductivity and CDW orders (A), mesoscale 
(>1 nm) phase separation between superconductivity and CDW domains (B), and a superconductivity- 
induced topological defect within the CDW domain (36) (C). The data are well reproduced in the case 
shown in (C). Note that in (C), the superconducting state is inhomogeneous and developing more strongly 
around the defects but still permeates the whole sample volume. 


However, the dynamical CDW data (red tri- 
angles) clearly do not follow the equilibrium 
temperature dependence; hence, they cannot 
be described by an effective temperature sce- 
nario. The increase in &, is well beyond what 
can be achieved at equilibrium, whereas the 
increase in J;,, remains small compared with 
the temperature-dependent swing. These 
data clearly corroborate that the nonthermal 
state created by the 800-nm laser pulse has 
no equilibrium counterpart. 

The ability to measure both CDWs and 
superconductivity on the same sample under sim- 
ilar excitation conditions provides unparalleled 
insight into the dynamics of intertwined orders. 
Next, we discuss three possible scenarios 
for the superconductivity-CDW interaction. 
First, we consider the case of homogeneous 
and coexisting orders, which we interpret 
within the framework of time-dependent 
Ginzburg-Landau (TDGL) theory, as discussed 
in other systems (22, 39). TDGL theory predicts 
that for homogeneous and competitive orders, 
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the CDW order parameter amplitude will 
increase on a picosecond time scale, driven 
by the quench of superconductivity (36). This 
scenario is depicted in Fig. 3A, and the 
corresponding CDW scattering peak shows 
a notable increase in the integrated inten- 
sity [compare with (24, 36)]. This is in stark 
disagreement with our data, where the sig- 
nal is dominated by a change in correlation 
length instead. We thus conclude that a sim- 
ple competition model, assuming homoge- 
neous and coexisting orders, is incompatible 
with our results. 

A second scenario is that of phase-separated, 
strongly competitive orders. In this case we 
consider well-separated CDW and supercon- 
ducting domains [see, e.g., (17)], with an av- 
erage spacing between neighboring CDW 
domains that is larger than the CDW perio- 
dicity (>1 nm). In this scenario, depicted in 
Fig. 3B, €, coincides with the size of the CDW 
domains, which expand in response to the 
suppression of neighboring superconducting 


domains. This inflation leads to an increase 
in Jjnt, thanks to the expanding CDW filling 
factor, where Jin, & €,”. Such scaling is in- 
compatible with our data, ruling out domain 
expansion as the root cause of the notable 
correlation length increase that we observe 
within ~2 ps after photoexcitation. Even in the 
case of a unidirectional expansion of domains, 
where the filling factor scales linearly with €,, 
hence, Jint © &, this scenario remains incom- 
patible with the data [see Fig. 3B and (36)]. 

To explain our observations, we describe 
a third hypothesis, illustrated in Fig. 3C. 
Before photoexcitation, the spatial coherence 
of the CDW domain is disrupted by a super- 
conductivity-induced defect, stabilized by an 
interstitial superconducting region within the 
domain. Similar situations have been discussed 
theoretically (40-42) in the context of CDW 
dislocations primarily caused by quenched 
disorder, with superconductivity forming op- 
portunistically around them. However, in our 
experiment, the light quench of the supercon- 
ducting condensate directly causes a reaction 
of the CDW within ~1 ps. Extrinsic CDW de- 
fects, that is, those pinned to disorder, typically 
relax on much longer time scales of hundreds 
of picoseconds (43) or longer (44), with light- 
driven dynamics of oxygen ordering taking 
place on time scales on the order of minutes 
(45). Furthermore, the depinning process 
is thermally activated (42, 44, 46), which is 
clearly incompatible with our observation 
of faster defect removal dynamics at 20 K 
(1 ps) as compared to 65 K (>3 ps). We thus 
conclude that our data are consistent with the 
hypothesis that superconductivity plays the 
lead role in shaping the inhomogeneous 
CDW landscape by creating intrinsic defects, 
which rapidly disappear as superconduc- 
tivity is quenched. To further corroborate 
this point, we note that vacancies caused by 
quenched disorder primarily affect J;,14, as 
reported in (47) and confirmed by our simu- 
lations (36). 

For comparison, we note that high magnetic 
field and uniaxial pressure experiments can 
explore the effects of quasi-static suppression 
of superconductivity. These works report an 
increase in the integrated intensity in addi- 
tion to the increase in correlation length 
(15, 19, 20). We interpret this difference in 
terms of the slower time scale of those external 
stimuli (>10~° s), which in turn affect both 
extrinsic and intrinsic defects. Instead, our 
experiment, by using ultrafast light excita- 
tion, can isolate the dynamics of the rapidly 
reacting intrinsic defects, which are primarily 
caused by superconductivity and not pinned 
to disorder. 

In terms of the microscopic character of 
these superconductivity-induced defects, we 
reiterate that nontopological point-like defects 
in the CDW pattern, such as vacancies, can be 
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excluded because their photoinduced removal 
should primarily affect Ji, (36), in disagree- 
ment with the data. We thus consider the case 
of topological defects, such as CDW disloca- 
tions and discommensuration lines (sketched 
in Fig. 3C), whose presence is well known in 
cuprates (12, 16, 18) and which can affect 
CDW coherence beyond small local perturba- 
tions. We find that the removal of discom- 
mensuration lines after the photoquench of 
superconductivity captures the salient aspects 
of the data (Fig. 3C), where €, nearly doubles 
without a increase in Ji, (36). The sudden 
photoquench of superconductivity leads to 
the annihilation of this topological defect, 
thus reestablishing phase coherence. Within 
this model, the ~0.003-r.l1.u. momentum shift 
reported in Fig. 2B can be explained in terms 
of a ~1% increase in periodicity after defect 
annihilation and subsequent expansion of the 
CDW pattern within the domain. 

The presence of periodic discommensura- 
tion lines could reconcile the experimental 
observation of an incommensurate CDW wave 
vector (6-11, 13-16) with reports of local com- 
mensurate order, as detected in Bi2212 (4, 5, 12) 
and YBCO (17, 18). Importantly, our data are 
consistent with an active role of supercon- 
ductivity in the formation and stabilization of 
discommensurations, which is suggestive of a 
strong coupling between such defects and 
superconductivity, as recently discussed with 
respect to transition-metal dichalcogenides 
(48). Within this scenario, we expect super- 
conductivity to be enhanced along the dis- 
commensuration lines, as depicted in Fig. 3C, 
forming a corresponding spatially modulated 
state, in line with the observation of pair den- 
sity waves in Bi-based superconductors (49, 50). 
Although we cannot determine the periodicity 
of these defects at the moment, we estimate an 
average spacing of at least ~3 to 4nm between 
discommensurations (36). 

Importantly, in our experiment, supercon- 
ductivity appears to disrupt the spatial co- 
herence within a CDW domain. This differs 
markedly from a phase-separated state for 
strongly competitive orders and corroborates 
the intertwining between superconductivity 
and CDWs. In this context, we also note that 
an ultrafast sliding of CDWs is required to 
restore a common phase pattern across the 
CDW domain within ~1 ps. The time scale of 
this CDW motion is compatible with the rela- 
xation dynamics of periodic lattice distortions 
(34), with an estimated speed of ~3 x 10° cm/s, 
which is comparable to the speed of sound 
in YBCO (57). 

By using an ultrafast probe of the CDW 
order parameter after the optical quench of 
superconductivity, we observe the dynamical 
interaction between superconductivity and 
CDWs. The results suggest that superconduc- 
tivity disrupts the CDW spatial coherence 
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within CDW domains by stabilizing topo- 
logical defects, such as discommensurations. 
Although further studies are needed to un- 
equivocally identify the microscopic nature 
of these defects, this approach establishes 
opportunities to study competing orders that 
are complementary to current steady-state 
methods, such as high fields and uniaxial 
pressure experiments. By observing the early 
state after the quench of one of the order 
parameters, our strategy allows the sensitive 
detection of spatial patterns that are intrin- 
sically related to the interaction between 
intertwined orders. 
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Multispecies forest plantations outyield 
monocultures across a broad range of conditions 


Yuhao Feng?, Bernhard Schmid*, Michel Loreau®, David I. Forrester*®, Songlin Fei®, Jianxiao Zhu’, 
Zhiyao Tang’, Jiangling Zhu’, Pubin Hong’, Chengjun Ji’, Yue Shi®, Haojie Su’, Xinyu Xiong’, 


Jian Xiao’, Shaopeng Wang", Jingyun Fang??* 


Multispecies tree planting has long been applied in forestry and landscape restoration in the hope of 
providing better timber production and ecosystem services; however, a systematic assessment of its 
effectiveness is lacking. We compiled a global dataset of matched single-species and multispecies 
plantations to evaluate the impact of multispecies planting on stand growth. Average tree height, 
diameter at breast height, and aboveground biomass were 5.4, 6.8, and 25.5% higher, respectively, 

in multispecies stands compared with single-species stands. These positive effects were mainly the 
result of interspecific complementarity and were modulated by differences in leaf morphology and leaf 
life span, stand age, planting density, and temperature. Our results have implications for designing 
afforestation and reforestation strategies and bridging experimental studies of biodiversity-ecosystem 


functioning relationships with real-world practices. 


orest plantations are an important means 
of restoring degraded land, supplying eco- 
system services, and mitigating climate 
change (1-4). In recent years, several 
global initiatives, such as the Bonn Chal- 
lenge (5), the New York Declaration on Forests 
(6), and the United Nations (UN) Decade on 
Ecosystem Restoration (7), have been launched 
to restore degraded ecosystems and provide 
nature-based solutions to mitigate climate 
change by increasing global forest cover in 
the coming decades. In the context of these 
global initiatives, there is an urgent need to 
understand the factors that promote the func- 
tions and services of forest plantations to 
ensure effective strategic planning of affor- 
estation and reforestation practices. 

In the hope of maximizing the benefits of 
plantations, early foresters explored various 
forest plantation strategies (8, 9). One strategy 
was to plant several tree species with different 
traits, such as legume and nonlegume species 
or conifer and broad-leaved trees—i.e., multi- 
species plantations (9, 10). Ecological theory 
and experiments predict that multiple species 
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growing together will create more biomass, 
called the biodiversity-ecosystem function- 
ing (BEF) relationship (//, 12). Biodiversity 
can increase biomass production because of 
either niche differentiation among species 
(complementarity effects) or an increased like- 
lihood that highly productive species are pre- 
sent (selection effects) (12). Field experiments 
in both grasslands (12, 13) and forests (14-16) 
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have revealed that complementarity effects 
generally contribute more than selection effects 
to the observed positive BEF relationships and 
that these effects change with traits of spe- 
cies, time, planting density, and environmental 
conditions. The positive effect of biodiversity 
on ecosystem productivity has also been doc- 
umented by global and regional observational 
studies (17, 18). The long history of multispe- 
cies plantation practices (10, 19, 20) and the 
growing experimental research on tree diver- 
sity effects [e.g., the global network of tree 
diversity experiments, TreeDivNet (16, 21)] 
provide an opportunity for a systematic assess- 
ment of the effects of species mixing in forest 
plantations and the mechanisms that underpin 
them. Here, we hypothesize that, as in BEF 
experiments with random species selection, 
multispecies plantations established by fores- 
ters can also increase tree growth and biomass 
production through complementarity effects 
and that these effects are modulated by biotic 
and abiotic factors. 

To test our hypotheses, we assembled a 
global dataset (Global MixTrees) of single- 
species (monocultures) and multispecies stands 
(mixtures or mixed stands) matched by age 
and planting density at 255 sites from 273 pub- 
lications (22) (figs. S1 and S2). The dataset 
contains 243 tree species with diverse func- 
tional traits, wide ranges of stand age and 
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Fig. 1. Effect sizes of species mixing on height, DBH, and aboveground biomass. (A) Effect sizes at 


the species level. (B) Effect sizes at the whole-stand 


level. In (B), net effect (NE), complementarity effect 


(CE), and selection effect (SE) are shown in blue, green, and yellow colors, respectively. Gray bubbles show 
the original sample effect size, and the two numbers above each subset graph indicate the number of 
samples with effect sizes greater than (blue) or smaller than (red) zero. The colored bars represent 

the 95% confidence intervals of mean effect size across samples. *P < 0.05. 
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Fig. 2. Impacts of functional trait composition on the effect sizes of species mixing. (A to C) Impacts 
at the species level. (D to F) Impacts at the whole-stand level. (A and D) Tree height. (B and E) DBH. 

(C and F) Aboveground biomass. Functional traits are divided into three categories: leaf morphology 
[broad-leaved (BL) versus needle-leaved (NL)], leaf life span [deciduous (DE) versus evergreen (EV)], and 
nitrogen acquisition strategy [nitrogen-fixing (NF) versus non-nitrogen-fixing (non-NF)]. In (A) to (C), 

“BL (BL x NL)” denotes the effect size for the BL species in the mixed stands consisting of BL and NL 
species; in (D) to (F), “BL / NL” denotes the mixed stands consisting of only BL or only NL species, and 
“BL x NL” denotes the mixed stands consisting of both BL and NL species. Other notations are similarly 
defined. Colored dots and shaded bars represent the mean effect sizes and their 95% confidence intervals, 
respectively. P values are derived from likelihood ratio tests (22). In (A) to (C), different letters denote 
significant differences in the mean effect size between groups. In (D) to (F), NE, CE, and SE are shown in 


blue, green, and yellow colors, respectively. 


planting density, and spanning a broad cli- 
matic gradient. It includes information on 
three major stand-level growth variables: 
mean tree height (in meters), mean diameter 
at breast height (DBH) (in centimeters), and 
aboveground biomass (in metric tons per hect- 
are) (data S1 and $2). We defined a sample as a 
set of matched monocultures and mixed stands 
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from the same site and with the same stand 
age, planting density, and mixing ratio (i.e., 
percentage of each species at planting) (figs. 
S3 and S4). Each sample typically had three 
to six replicates, which we used to charac- 
terize spatial environmental heterogeneity 
and estimate the variation of species mixing 
effects within the site. In total, our dataset con- 


sisted of 5959 samples, including 2323 samples 
for mean height, 2362 samples for mean DBH, 
and 1274 samples for aboveground biomass. 
Most samples had mixed stands consisting of 
two species (5439 samples), and the remaining 
had 3 to 16 species (520 samples) (fig. S5). We 
conducted our analyses at both the species 
(i.e., the single target species) and community 
(i.e., the whole stand) levels. For species-level 
analyses, 4406 samples were used to compare 
the growth of a target species in mixed stands 
with its growth in monocultures (fig. $3); for 
whole-stand-level analyses, the remaining 
1553 samples were used to compare the growth 
of the whole mixed stands with that of the cor- 
responding monocultures (fig. S4). 

Using this dataset, we examined the effects 
of species mixing on three growth variables 
(tree height, DBH, and aboveground biomass) 
and explored how biotic (i.e., functional traits, 
stand age, and planting density) and abiotic 
(i.e., temperature and precipitation) factors 
influenced the effects of species mixing. For 
each growth variable, we quantified the effect 
size of species mixing at the species level using 
the logarithmic ratio of the growth variable 
in mixed stands versus monocultures of the 
target species [eqs. S7 and S8 (22, 23)] and 
at the whole-stand level using the relative 
difference between the growth variable 
in mixed stands and the average of this 
variable in the monocultures of the compo- 
nent species [eqs. S9 and S10 (22)]. At the 
whole-stand level, we further partitioned the 
effect size of species mixing (also called net 
effect) additively into complementarity and 
selection effect to disentangle whether the 
mixing effect was caused by niche differenti- 
ation or the presence of specific species (13). 
Then, we pooled all effect sizes across sam- 
ples to derive an overall effect size of species 
mixing at both species and whole-stand levels, 
as well as for complementarity and selection 
effects (23). For all analyses, we conducted 
tests of robustness using samples with two- 
species mixed stands and found that all results 
were qualitatively similar (figs. S6 to S9). We 
also tested whether effect sizes increased with 
species richness beyond two species. 

Our analyses showed that, at the species 
level, the mean values of the three growth var- 
iables were all larger in multispecies stands 
than in monocultures (Fig. 1A). The mean effect 
sizes for height, DBH, and biomass were 0.060 
(95% confidence interval, 0.047 to 0.073), 0.065 
(0.050 to 0.079), and 0.196 (0.160 to 0.232), 
respectively, corresponding to a relative in- 
crease of 6.2% (4.8 to 7.6%), 6.7% (5.1 to 8.2%), 
and 21.7% (17.3 to 26.1%), respectively. Positive 
effect sizes for these variables occurred in 
70.1, 70.0, and 76.7% of all samples, respec- 
tively (Fig. 1A and fig. S10, A to C). The effect 
sizes of these growth variables tended to in- 
crease with species richness (fig. S11, A to C). 
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Fig. 3. Changes in the mixing effects with stand 
age. (A to C) Changes at the species level. (D to 
F) Changes at the whole-stand level. (A and D) Tree 
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likelihood ratio tests (22). Shaded areas are 95% 
confidence intervals of fitted curves, with dashed lines 
indicating P > 0.05. Bubble sizes are proportional to the 
weights of samples. In (D) to (F), NE, CE, and SE are 
shown in blue, green, and yellow colors, respectively. 
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Fig. 4. Changes in the mixing effects with tree density. (A to ©) Changes at the species level. (D to 

F) Changes at the whole-stand level. (A and D) Tree height. (B and E) DBH. (C and F) Aboveground 
biomass. The upper limit of planting density is set to 10,000 stems ha”, and the density values are 
logio-transformed before statistical analyses to avoid bias as a result of high leverage of a few sites with 
very high densities. In (D) to (F), NE, CE, and SE are shown in blue, green, and yellow colors, respectively. 


For other explanations, see the caption of Fig. 3. 


At the whole-stand level, the three growth 
variables were also higher in multispecies 
stands than in monocultures (Fig. 1B), with 
an increase of 5.4% (3.9 to 6.8%) in height, 
6.8% (5.3 to 8.2%) in DBH, and 25.5% (21.2 to 
29.8%) in biomass. Positive net effects of these 
variables occurred in 71.4, 74.5, and 85.0% of 
all samples, respectively (Fig. 1B and fig. 
S10, D to F). These positive net effects were 
mainly the result of complementarity rather 
than selection effects (which were close to zero 
or even negative) (Fig. 1B and fig. S10, G to L). 
These small selection effects can be understood 
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by the fact that foresters deliberately use highly 
productive tree species (24) and thus exclude 
low-performing species that probably represent 
most of the species in BEF experiments with 
random species selection (J4) (if component 
species have equal performance in mono- 
culture, then selection effects are zero by 
definition). As at the species level, the net 
and complementarity effects of the growth 
variables increased with species richness 
beyond two species (fig. S11, D to F), which 
supports the conclusions of previous experi- 
mental and observational studies (14, 17, 18). 


To disentangle the mechanisms underpin- 
ning the positive effects of species mixing, we 
investigated how combinations of tree spe- 
cies with different functional traits influenced 
tree size or biomass in multispecies plantations. 
We focused our analysis on three groupings 
of species according to functional traits: leaf 
morphology (i.e., broad-leaved versus needle- 
leaved species), leaf life span (i.e., deciduous 
versus evergreen species), and nitrogen ac- 
quisition strategy (i.e., nitrogen-fixing versus 
non-nitrogen-fixing species). For each of these 
three groupings, multispecies plantations with 
opposite functional traits may produce higher 
complementarity effects through niche parti- 
tioning: Broad-leaved and needle-leaved spe- 
cies can partition space in the canopy (25), 
deciduous and evergreen species can partition 
seasonal time (26), and non-nitrogen-fixing 
species and nitrogen-fixing species can parti- 
tion the source of nitrogen nutrition (27). 

At the species level, target species tended 
to benefit more in one or more growth varia- 
bles when growing with species in the opposite 
functional trait categories compared with those 
growing with species with similar traits (fig. 
$12). Both broad- and needle-leaved species 
grew taller and produced more biomass when 
growing with species of the other functional 
group than when growing with species of 
the same functional group (Fig. 2, A and C). 
Deciduous and evergreen species also grew 
taller when growing with species of the other 
compared with species of the same functional 
group (Fig. 2A). However, nitrogen-fixing and 
non-nitrogen-fixing species performed sim- 
ilarly when growing with species of the other 
or with species of the same functional group 
(Fig. 2, A to C). Analyses at the whole-stand 
level corroborated these species-level results. 
Multispecies stands containing both broad- 
and needle-leaved species had larger net and 
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complementarity effects on all three growth 
variables than multispecies stands containing 
only broad-leaved or only needle-leaved species 
(Fig. 2, D to F, and fig. S13). Multispecies stands 
containing both deciduous and evergreen spe- 
cies had larger net and complementarity effects 
on height and DBH than multispecies stands 
containing only deciduous or only evergreen 
species (Fig. 2, D and E). However, multi- 
species stands containing both nitrogen-fixing 
and non-nitrogen-fixing species performed 
similarly to multispecies stands containing 
only nitrogen-fixing or only non-nitrogen- 
fixing species (Fig. 2, D to F). Overall, our 
results suggest that functional differences in 
leaf morphology and leaf life span—but notably 
not in nitrogen acquisition strategy—underpin 
stronger complementarity effects in mixed 
stands (15, 28). 

Next, we examined whether stand charac- 
teristics (stand age and planting density) mod- 
ified the effects of species mixing. At both the 
species and whole-stand levels, the effect sizes 
of species mixing on height, DBH, and bio- 
mass all showed unimodal relationships when 
plotted against stand age, which peaked at 
~25 years (Fig. 3). These unimodal patterns 
were mainly driven by time-dependent com- 
plementarity effects, which first increased and 
then decreased with stand age (Fig. 3, D to F). 
Previous BEF experiments have revealed in- 
creasing complementarity and biodiversity 
effects with experimental duration but were 
limited to a relatively short time span [e.g., 
<15 years (14, 29)]. Our results corroborate 
findings from these short-term experiments, 
but they also suggest that as stands develop 
further, complementarity effects may subse- 
quently decrease, possibly as a result of en- 
hanced interspecific competition for light or 
soil resources (30). 

Along a gradient of tree planting density, 
the species-level effect sizes for height and 
DBH showed increasing trends (Fig. 4, A and B), 
but that for biomass showed a unimodal pat- 
tern, peaking at a density of ~2860 stems ha * 
(Fig. 4C). At the whole-stand level, the effect 
sizes for all three growth variables showed 
similar unimodal patterns along a gradient 
of tree density, peaking at ~2500, 2700, and 
4100 stems ha", respectively (Fig. 4, D to F). 
These results support previous findings that 
the effect of species mixing increases with 
planting density within low-density ranges 
[e.g., <2000 stems ha‘ (37)] but that negative 
effects can emerge at high densities (32). In 
other words, the benefits of multispecies plan- 
tations may be reduced at both very low den- 
sities, where individuals are far apart and thus 
do not interact, and very high densities, where 
intense competition may inhibit the operation 
of complementarity (30). 

Climate (temperature and precipitation) may 
be a factor that influences the effects of species 
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mixing (20). At both the species and whole- 
stand levels, the effect sizes for the three growth 
variables all showed increasing trends along a 
gradient of mean annual temperature [fig. S14; 
see (22) for climatic data]. This increase was 
largely the result of interspecific complemen- 
tarity, whereas selection effects were negligi- 
ble at the whole-stand level. This suggests that 
warmer climates could benefit tree growth 
in multispecies stands more than in mono- 
cultures, which may be because of longer grow- 
ing seasons that promote complementarity in 
time (33). Compared with temperature, mean 
annual precipitation did not significantly in- 
fluence the mixing effect sizes (fig. S15), prob- 
ably because forest plantations mainly occurred 
in humid areas where precipitation may not 
be a limiting factor (34). 

Our study demonstrates substantial benefits 
of multispecies plantations, which corrobo- 
rates and generalizes findings from forest BEF 
experiments (14, 16, 21) and helps to show 
their implications for real-world forestry ap- 
plications. Our analyses offer predictions that 
go beyond the conclusions of short-term BEF 
experiments with random species selection 
and call for more long-term experiments with 
deliberate species selection to reveal the time 
dependency of biodiversity effects. Our find- 
ings also have direct implications for forest 
management and afforestation and refores- 
tation practices. Foresters often prefer mono- 
culture plantations because of their lower 
planting and management costs, targeted 
special uses of trees (e.g., kind and amount of 
timber production and economic value) (35), 
and the difficulty in identifying the site char- 
acteristics that will give the best result for a 
given multispecies plantation (30, 35). How- 
ever, a growing body of research is revealing 
the benefits of multispecies plantations in 
terms of productivity (this study), stability 
(16, 36), community structure (15), and bio- 
diversity (4). Therefore, we advocate a multi- 
functional cost-benefit analysis framework to 
identify planting strategies that optimize the 
benefits of forest plantations for restoration, 
conservation, and climate change mitigation. 
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Directed evolution of nonheme iron enzymes to 
access abiological radical-relay C(sp*)-H azidation 


Jinyan Rui’, Qun Zhao“}, Anthony J. Huls‘}, Jordi Soler’, Jared C. Paris®, Zhenhong Chen’, 
Viktor Reshetnikov', Yunfang Yang‘, Yisong Guo**, Marc Garcia-Borras2*, Xiongyi Huang’* 


We report the reprogramming of nonheme iron enzymes to catalyze an abiological C(sp*)-H azidation 
reaction through iron-catalyzed radical relay. This biocatalytic transformation uses amidyl radicals 

as hydrogen atom abstractors and Fe(IIl)-N3 intermediates as radical trapping agents. We established a 
high-throughput screening platform based on click chemistry for rapid evolution of the catalytic 
performance of identified enzymes. The final optimized variants deliver a range of azidation products 
with up to 10,600 total turnovers and 93% enantiomeric excess. Given the prevalence of radical 

relay reactions in organic synthesis and the diversity of nonheme iron enzymes, we envision that this 
discovery will stimulate future development of metalloenzyme catalysts for synthetically useful 
transformations unexplored by natural evolution. 


to mediate carbene- and nitrene-transfer re- 
actions (4), reprogramming flavoenzymes for 
photoredox catalysis (5, 6), and reconfiguring 
carbonic anhydrase to perform metal-hydride 
chemistry (7). Despite this progress, most re- 
actions in organic synthesis have no known 
biological counterparts, and the mechanisms 
that empower these transformations were not 
developed by nature during the course of nat- 
ural evolution (7). To unleash the full potential 


ntroducing abiological chemical transfor- 
mations to natural proteins represents a 
powerful approach to advance enzymatic 
catalysis to reaction territories unexplored 
by natural evolution (/). This strategy 
enables enzyme reprogramming to achieve 
challenging synthetic reactions regio- and 
enantioselectively while maintaining genetic 
tunability (2, 3). Representative examples include 
taming heme and nonheme metalloenzymes 
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of enzymes for modern chemical synthesis, it 
is pivotal to introduce fundamental reaction 
modes in synthetic chemistry to the catalytic 
repertoire of biology. 

We consider a mechanism-driven approach to 
expand the scope of biocatalysis. We envisioned 
that natural metalloenzymes that share mecha- 
nistic elements with synthetic metal-catalyzed 
reactions would exhibit promiscuous activity 
toward these reactions, from which new cat- 
alytic functions can be evolved. Guided by this 
design principle, we envisaged that nonheme 
iron enzymes could be reprogrammed to per- 
form radical-relay C—H functionalization—an 
important class of unnatural reactions widely 
used in organic synthesis (8-17). The defining 
feature of a metal-catalyzed radical relay is 
the use of a reactive radical (X*) to activate 
a C(sp?)-H bond through hydrogen atom 
transfer (HAT) and the interception of the 
resulting carbon-centered radical by a redox- 
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active metal complex (Fig. 1A). This process 
mechanistically resembles the C(sp?)-H ha- 
logenation reactions catalyzed by nonheme 
iron halogenases, in which an iron(IV)-oxo 
complex is used to activate substrates by means 
of HAT and an iron(IIDhalide/azide inter- 
mediate intercepts substrate radicals to form 
carbon-halogen/azide bonds (Fig. 1B) (12-16). 
Enlightened by this mechanistic similarity, we 
proposed that a nonheme iron enzyme could 
mediate a radical relay process through an 
initial substrate activation at a Fe(II) center 
to generate a reactive amidyl radical for HAT 
and subsequent transfer of a Fe(III)-bound 
ligand to the carbon-centered radical (Fig. 
1C). This new biocatalytic reaction proceeds 
through a redox neutral pathway involving 
an Fe"/Fe" redox couple, which offers a com- 
plementary approach to the native Fe” cat- 
alytic cycle of nonheme iron enzymes for C-H 
functionalization. 

As proof-of-concept, we employed this en- 
zymatic radical-relay strategy to develop new 
biocatalysts to perform a nonnatural C(sp*)-H 
azidation reaction. Current synthetic approaches 
for this reaction are limited in turnovers, not 
enantioselective, and require an acidic azide 
source (scheme S4) (17). We presumed that 
these challenges could be met by leveraging 


the genetic tunability and high catalytic effi- 
ciency of nonheme iron enzymes. We began our 
investigation by testing a panel of nine func- 
tionally diverse nonheme iron enzymes with 
an N-fluoroamide substrate INF under whole- 
cell conditions. A (4-hydroxyphenyl)pyruvate 
dioxygenase from Streptomyces avermitilis 
(Sav HppD) provided the desired azidation 
product with 250 total turnovers (TTN), an 
enantiomeric ratio (e.r.) of 63:37, and a chemo- 
selectivity of 9:1 for azidation over fluorination 
product (tables S1 and S2). Only a trace amount 
of the azidation product was obtained in a 
reaction lacking Sav HppD entry 10, table S1. 
Moreover, mutating the two iron-coordinating 
histidines to alanines abolished the enzyme 
activity while retaining the fold of wild-type 
(WT) Sav HppD (entry 11, table S1, and fig. $1), 
supporting the proposal that this enzymatic 
azidation occurs at the 2-His-1-carboxylate iron 
center. The unazidated amide product was also 
detected in trace amounts but was likely formed 
through an unidentified nonenzymatic pro- 
cess as the double alanine mutant afforded 
this product in a yield comparable to that of 
the WT enzyme (tables S1 and S82). 

We set out to improve the performance of 
Sav HppD through directed evolution. We 
performed computational modeling on the 


WT enzyme with both azide and INF substrate 
bound and chose fifteen residues for opti- 
mization (figs. S10, S11, S13, and S14). These 
residues primarily reside in three regions: 
the C-terminal, o helix, and f barrel of the 
C-terminal domain, as well as in loops sur- 
rounding the active site (Fig. 2A). We have 
also established a high-throughput screening 
(HTS) platform based on copper-catalyzed 
azide-alkyne cycloaddition (CuAAC) (78) and 
achieved reliable quantification of enzymatic 
azidation products with a coefficient of var- 
iation of 9% and a detection limit of 4 uM 
(scheme S1 and fig. $2). With this HTS plat- 
form, we evaluated more than 5000 clones 
generated through error-prone polymerase 
chain reaction (PCR) or site-saturation muta- 
genesis (Fig. 2, Band C). Asextuple mutant 
Sav HppD V189A F216A P243A N245Q Q255A 
L367I (denoted as Sav HppD Az1) furnished 
the product with 1340 TTN and 87:13 e.r. In 
this evolution campaign, we could not identify 
an enzyme variant with an e.r. higher than 
87:13. This result indicates that mutations 
that were beneficial for improving activity 
might not necessarily lead to an increase in 
enantioselectivity, which might be due to 
the differences in substrate positioning and 
geometric requirements for the rate-determining 
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Fig. 2. Directed evolution of nonheme iron enzymes for radical-relay C-H 
azidation. (A) Protein residues selected for mutagenesis. Pink: loop residues 
surrounding the active site (N191, F216, Q255, and F359); green: residues 

on the C-terminal a helix (K361, L367, and N363); blue: residues on the £ barrel 
of the C-terminal domain (V189, S230, P243, N245, Q269, Q334, F336, and 
R353) (PDB: 1147). (B) A high-throughput screening platform for detection of 
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enzymatic azidation products. (C) Representative variants identified during 
the directed evolution of Sav HppD. Experiments were performed at the 
analytical scale by using suspensions of Escherichia coli (E. coli) cells 
expressing Sav HppD variants (ODgo0 = 10), 10 mM substrate INF, 25 mM 
NaN, 2.5 mM Fe? in KPi buffer (pH 7.4) at room temperature under anaerobic 
conditions for 24 hours (table S2). 
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Fig. 3. Substrate scope and synthetic applications of enzymatic C-H synthesized azidation product 1 was determined to be S through x-ray 
azidation via iron-catalyzed radical relay. (A) Substrate scope of Sav HppD crystallography. The absolute configurations of all other azidation products 
Azl and Sav HppD Az2. Experiments were performed at the analytical scale were inferred by analogy. (B) Preparative scale synthesis and absolute 
by using suspensions of E. coli expressing Sav HppD variants in the KPi buffer configuration determination. (©) One-pot chemoenzymatic synthesis by in situ 
(pH 7.4) at room temperature under anaerobic conditions for 24 hours (for derivatization of enzymatic azidation products through CuAAC (for detailed 
detailed conditions see table S3). The absolute configuration of enzymatically conditions see section IX of the supplementary materials). 
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Fig. 4. Spectroscopic and computational studies of the enzymatic radical- 
relay C-H azidation. (A) (Left) Mossbauer spectrum of the Sav HppD Azl1+Fe(Il) 
complex (top, black) and the spectroscopic changes upon azide addition 
(bottom, black). The upward and downward absorption peaks represent 

the spectral components that disappeared or appeared, respectively, after 
the addition of azide. The colored solid lines represent spectral simulations 
(see supplementary materials for detailed discussion). (Right) EPR spectrum 
of Sav HppD Azl+Fe(Il)*N3 complex after incubation with 18NF for 60 min 
(black) and the spectral simulation (red). (B) (Left) Optical absorption spectra 
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of Sav HppD AzleFe(Il)*N3 complex with 18NF (black) and after incubation 
with 18NF for 60 min (red). The inset shows the reaction scheme. (Right) The 
time-dependent change in the 505 nm feature. (C) Active site arrangement 
of the Az2 variant with INF substrate bound in a near-attack conformation 
for N-F activation characterized from MD simulations (see supplementary 
materials for details; fig. S15). (D) Reaction mechanism obtained from 

DFT calculations employing a truncated active-site model built from MD 
simulations (see fig. S18 for details) (energies in kcal/mol, distances in A, and 
angles in degrees). 
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(wild type) for %,,, and 790 uM (Az1) versus 
470 uM (wild type) for Ky;], whereas the more 
enantioselective Az2 variant displayed a 9-fold 
decrease in Ket (3.39 min’) and a 6.6-fold de- 
crease in Kj, (120 uM). Overall, both variants 
showed ~2-fold improvement in catalytic effi- 
ciency (Kea;/Ky,) compared with that of the WT 
enzyme (fig. S3). 

With these two final variants in hand, we 
optimized reaction conditions (table S3) and 
assayed a range of N-fluoroamide substrates 
to explore the scope and limitation of this 
reaction (Fig. 3A). Sav HppD Azi1 generally 
exhibited higher activity but lower enantiose- 
lectivity than Sav HppD Az2. The enzymatic 
reaction tolerated a range of aromatic substi- 
tution patterns with total turnovers up to 
10,060 and enantiomeric ratio up to 96.5:3.5 
(product 5N). Substrates with an extended 
alkyl chain at the benzylic position were well 
tolerated, providing products with moderate- 
to-good TTNs and enantioselectivity (products 
8N to 10N). The amide nitrogen substituent 
also affected enzyme performance as evidenced 
by a decrease in activity when a larger N-tert- 
amyl group is substituted for the N-tert-butyl 
group (1N and 6N, 15N and 17N). We also 
tried to extend the scope of N-radical precur- 
sors and replace azide with another halide or 
pseudohalide anion (scheme S2 and table S4). 
However, these efforts have not been success- 
ful as starting materials were recovered in 
most cases. As suggested by Méssbauer studies, 
the inability of our method to incorporate other 
anionic ligands might be due to much weaker 
binding of these anions to the Fe(II) center of 
the enzymes. In a larger-scale reaction, Sav 
HppD Az1 furnished IN in 65% isolated yield 
at a 120-mg scale with undiminished enantio- 
selectivity (Fig. 3B). We also obtained single 
crystals of IN and assigned their absolute con- 
figuration as S by x-ray crystallography. We 
also produced primary organic azide 11N at a 
preparative scale and subsequently converted 
it into the estrone derivative 18 through a 
CuAAC reaction (Fig. 3C). This chemoenzymatic 
two-step synthesis yielded the triazole product 
19 in 55% isolated yield, demonstrating the 
potential of our platform to produce highly 
functionalized molecules when used in tandem 
with biocompatible reactions. 

We conducted several mechanistic studies 
to investigate the catalytic cycle proposed in 
Fig. 1C. Addition of Ns to the Sav HppD 
AzieFe(II) complex induced the formation of 
two quadrupole doublets in the Méssbauer 
spectrum with isomer shifts (6) of 1.20 and 
1.17 mm/s and quadrupole splittings (AEg) of 
2.29 and 2.97 mm/s, respectively. The obser- 
vation of two quadrupole doublets may reflect 
different azide binding configurations to the 
Fed center (Fig. 4A and figs. S5, S8, and S10; 
see section X of the supplementary materials 
for further discussion). We then carried out 
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electron paramagnetic resonance (EPR) mea- 
surements on a nitric oxide (NO)-bound Sav 
HppD Azi¢Fe(II) complex, which has a prom- 
inent g ~ 4 EPR resonance that can be used to 
monitor the interactions between the substrate 
and the nonheme iron center (19). Adding azide 
to the Sav HppD Azi1eFe(II)*-NO complex in- 
creased the rhombicity (E/D) of the g ~ 4 sig- 
nal from 0.014 to ~0.017, and further addition 
of INF continued the increase in the signal 
rhombicity (E/D = 0.023). These observations 
suggest that both Nz and INF interact with 
the Fe(II) center of Sav HppD Az] (Fig. 4A, 
fig. S7, and table S6). To demonstrate that an 
Fe(III)-N3 species is involved in the reaction, 
we incubated Sav HppD AzieFe(II)«N3 with an 
N-fluoroamide 18NF that lacked the reactive 
benzylic C-H bonds. Slow accumulation of a 
red species was observed by optical absorption 
centered at 505 nm (Fig. 4B), which likely 
originated from the Fe(III)-N; ligand-to-metal 
charge transfer band (20-22). The EPR signal 
of this red species was located at g ~ 4.3, fur- 
ther confirming that the iron oxidation state 
of the formed species is in high spin (S = 5/2) 
Fe(III) (see section X of the supplementary 
materials). We also observed the formation of 
a minor stable organic radical centered at g = 2 
(scheme S6). Although further studies are needed 
to characterize this radical species, we speculated 
that it may be a secondary radical formed through 
quenching of the initial amidyl radical, as this 
g = 2 signal was not observed when incubat- 
ing Sav HppD AzleFe(ID«Nz with the model 
N-fluoroamide substrate INF (scheme S6). 
We also performed computational modelling 
to understand the molecular basis of this re- 
action. Focusing on enantioselective variant 
Sav HppD Az2, MD simulations showed that 
V189A and P243G generated more space to 
accommodate iron-bound azide in the active 
site (Fig. 4C and figs. S8 to S12). In WT Sav 
HppD, N191, N245, and $230 participated in 
a hydrogen bonding network with Q269 for 
native substrate positioning (23). Introduction 
of the mutations N191A, S230L, and P243G dis- 
rupted this network. These mutations, together 
with N245F and L367I, created a hydrophobic 
environment to accommodate N-fluoroamide 
substrate INF for N-F activation and position 
the ethyl group of the substrate closer to the 
iron-bound azide in a restricted and preorganized 
conformation for the subsequent reaction 
steps (Fig. 4C and fig. S18). Model DFT calcu- 
lations (Fig. 4D) indicated that the initial N-F 
activation step (AG*(TS3-N1) = 17.2 kcal- 
mol *) was rate limiting and was followed by 
a fast 1,5-HAT (AG*(TS2) = 3.9 kcal-mol*) to the 
N-centered radical. This mechanistic scheme 
was similar to that reported in an iron-catalyzed 
fluoroamide-directed fluorination (24) and was 
consistent with the absence of a kinetic iso- 
tope effect through measuring independent 
initial rates for reactions with INF and INF-d, 


(fig. S4). The major conformation of substrate 
INF in Az2 preorganized the pro-S benzylic 
C-H bond for HAT, with the benzylic carbon 
projected in proximity to the iron-bound azide 
(Figs. 4C and fig. S18). Therefore, after 1,5-HAT 
the resulting C-radical was well positioned 
for azide recombination in a stereo-retentive 
manner with low energy barriers (AG*(TS3- 
N1) = 4.4 keal-mol ’). Notably, although fluo- 
rine transfer had an intrinsically low activation 
barrier (AG* = 5.0 kcal-mol™, fig. $18), steric 
constraints imposed within the active site 
likely prevented the substrate repositioning 
to enable fluorine recombination. Such control 
of radical rebound through substrate position- 
ing has also been observed in native reactions 
catalyzed by nonheme iron halogenases, in 
which the proximity of substrate radicals toward 
the iron-bound chloride facilitated C-Cl bond 
formation over C-OH bond formation (25, 26). 

The biocatalytic system reported here per- 
forms C(sp)-H functionalization reactions 
through a metal-catalyzed radical-relay mech- 
anism, thus expanding the scope of nonheme 
iron enzyme catalysis. We envision that the 
merger of various radical generation processes 
in synthetic chemistry and the capability of 
metalloenzymes for radical trapping will pro- 
vide a powerful and general strategy to advance 
the frontier of radical biocatalysis. 
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Optical interferometry-based array of seafloor 
environmental sensors using a transoceanic 


submarine cable 


G. Marra?*, D. M. Fairweather’, V. Kamalov’, P. Gaynor’, M. Cantono®, S. Mulholland’, B. Baptie*, 
J. C. Castellanos’, G. Vagenas', J.-O. Gaudron’, J. Kronjager’, |. R. Hill’, M. Schioppo’, 
|. Barbeito Edreira’, K. A. Burrows’, C. Clivati®, D. Calonico®, A. Curtis? 


Optical fiber-based sensing technology can drastically improve Earth observations by enabling the use of 
existing submarine communication cables as seafloor sensors. Previous interferometric and polarization- 
based techniques demonstrated environmental sensing over cable lengths up to 10,500 kilometers. 
However, measurements were limited to the integrated changes over the entire length of the cable. We 
demonstrate the detection of earthquakes and ocean signals on individual spans between repeaters 

of a 5860-kilometer-long transatlantic cable rather than the whole cable. By applying this technique to 
the existing undersea communication cables, which have a repeater-to-repeater span length of 45 to 
90 kilometers, the largely unmonitored ocean floor could be instrumented with thousands of permanent 
real-time environmental sensors without changes to the underwater infrastructure. 


espite substantial expansion of sensing 

capabilities, seas and oceans remain 

mostly unmonitored, limiting our under- 

standing of Earth’s structure and its dy- 

namic behavior. However, research on 
techniques exploiting the sensitivity of optical 
fiber to environmental perturbations over the 
past few years has shown that the existing net- 
work of submarine cables could potentially be 
used as seafloor sensors. While backscattering- 
based techniques, such as distributed acoustic 
sensing (DAS), provide high sensitivity and 
spatial resolution, such approaches are cur- 
rently limited to coastal areas up to 100 km 
from the shore owing to signal attenuation 
(1-4). Previously, we showed that optical in- 
terferometric techniques, which detect optical 
phase changes, provide a solution for moni- 
toring over much longer distances than the 
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working range of DAS, but at the expense of 
a lower spatial resolution (5). More recently, 
Zhan et al. demonstrated that polarization 
of light can also be used for environmental 
sensing over submarine cables up to 10,500 km 
in length (6). However, in both techniques, the 
measured quantity is the cumulative phase or 
polarization change over the entire length of 
the optical fiber cable. Although the detection 
of earthquakes and ocean waves was success- 
fully demonstrated, measuring the integrated 
perturbations over the entire cable length 
has two major limitations. First, the measure- 
ment noise floor is set by the background 
environmental noise integrated over the en- 
tire cable length. This limits the detection of 
smaller-sized earthquakes and other environ- 
mental signals of interest. Second, because 
the cable acts as a single sensor, signals from 
multiple cables are required to triangulate 
the location of sources, such as earthquake 
epicenters (5). 

Our work overcomes the first limitation and 
substantially improves on the second by using 
an optical technique that enables the mea- 
surement of environmentally induced optical 


phase changes over individual sections of the 
submarine cable rather than its entire length. 
In doing so, we demonstrate the detection of 
earthquakes, microseisms, and ocean currents 
spatially resolved at several locations along the 
cable. We show that the substantially higher 
sensitivity achieved by this interferometry- 
based technique enables the detection of signals 
that could not be observed when measuring 
the cumulative optical perturbation across the 
entire cable. Lastly, we show that, by perform- 
ing spanwise measurements, a single cable is 
sufficient to identify the epicentral region of 
a teleseismic earthquake. 

The cable sections correspond to one or 
more spans between the repeaters used for 
the amplification of the optical signal along 
the cable. The technique we used allows for the 
environmentally induced optical perturbations 
in each cable section to be measured indepen- 
dently, effectively converting a submarine 
cable into an array of fiber-based sensors. The 
number of sensors can be as high as the num- 
ber of repeater-to-repeater cable spans. For 
an intercontinental link, such as the one we 
used, the number of spans can exceed 100. 
The technique we present has the potential to 
transform our Earth-monitoring capabilities, 
as the currently greatly undersampled sea- 
floor could be instrumented with thousands 
of interferometry-based sensors without mod- 
ifying the existing submarine telecommunica- 
tion infrastructure. Crucially, although in our 
work we used a research-grade ultrastable 
laser, we show that more compact and lower 
cost narrow-linewidth telecommunications 
lasers could also be used with limited or no 
loss of sensitivity. This feature makes our 
technique highly scalable toward potential 
conversion of the existing subsea cable infra- 
structure into a global network of environmen- 
tal sensors. 

We achieved an array of sensing elements by 
exploiting the architecture of repeaters in mod- 
ern submarine cables. These repeaters often 
include a high-loss loop back (HLLB) path that 
enables a small fraction of the transmitted light 
to be returned to the transmitting end via the 
second fiber of the fiber pair (Fig. 1). On many 
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Fig. 1. Map and details of the UK-Canada submarine link. (A) Illustration of the high-loss loop back (HLLB) architecture and cable bathymetry. (B) Map of the 
UK-Canada submarine cable. The actual number of repeaters (128) was reduced in the figure for illustration purposes. Each repeater has a pair of fiber Bragg 


grating (FBG) reflectors. 


cables, the return path is implemented by 
using fiber Bragg gratings (FBGs), acting 
as narrow-band reflectors (fig. SIA). These 
return paths are used by the cable operator 
to periodically check the health of the opti- 
cal amplifiers (7). These checks are typically 
performed on a schedule or if there is a mal- 
function, leaving these channels unused most 
of the time. By injecting infrared light from 
an ultrastable laser source and performing 
high-precision interferometric measurements 
of the signal returned by the HLLBs, we ex- 
tracted the round-trip (loop-back) accumulated 
optical phase changes in the fiber between the 
transmitter end of the cable and the chosen 
repeaters. To localize the perturbations to cable 
sections between two or multiple repeaters, 
we computed the difference between loop- 
back signals. As the wavelength of the FBG 
reflectors is the same for all HLLBs, we used 
an optical technique called optical frequency 
domain reflectometry (8), whereby the fre- 
quency of the light source is swept such that 
the loop-back signal from each repeater can be 
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separated in the frequency domain. This tech- 
nique is used in some Rayleigh scattering- 
based fiber sensing applications (9). However, 
in our test, we did not use the scattered light 
from the fiber inhomogeneities but rather the 
signal returned by the FBG-based reflector in 
the HLLBs. This resulted in a returned signal 
that was orders of magnitude higher in ampli- 
tude than from back-scattering. The frequency- 
swept laser light was injected into the forward 
path of the submarine cable. The light returned 
at the transmitter, which consisted of the over- 
lap of the returned light from each repeater, 
was combined on a photodetector with the 
output of the swept laser. Here, a series of radio 
frequency signals was obtained, where each 
frequency identifies a different repeater with 
frequency proportional to the round-trip time 
(fig. S2). Each loop-back beat note was spec- 
trally filtered and its phase, relative to that 
of the local laser source, was measured with 
a phase meter. 

We tested this technique on a 5860-km-long 
intercontinental submarine optical fiber link 


between the UK and Canada. The link con- 
sists of two cables: a 248-km-long cable from 
Southport, UK, to Dublin, Ireland, and a 
5612-km-long cable from Dublin to Halifax, 
Canada. A total of 128 optical repeaters are 
installed on the link, and the average span 
between them is 46 km. Our experimental 
setup allowed us to simultaneously measure 
loop-back signals from up to 12 repeaters at a 
time, which could be selected. The hardware 
upgrades required to extend the monitor- 
ing capability to the full number of spans be- 
tween repeaters are shown in fig. S3. Because 
of temperature-induced polarization changes 
in the cable, the signal-to-noise ratio (SNR) of 
some of the available loop-back signals could 
temporarily become insufficient for the fre- 
quency counter to perform a correct measure- 
ment. In our tests, on average, seven to nine 
loop-back signals exhibited sufficient SNR. 
Moreover, the SNR of the returned signal 
degraded because of the accumulation of 
amplified spontaneous emission noise of the 
optical amplifiers for repeaters increasingly 
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Fig. 2. Earthquake detections on submarine cable spans. (A) Time series 
and spectrograms of the optical frequency deviation induced by the Northern Peru 
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Rosais, Azores, Portugal, Portuguese National Seismic Network. (€) Map of the cable 
sections used for data displayed in (A) and (B) and location of the seismometers 


earthquake detection. Vertical blue lines show the theoretical P wave arrival time. 
(B) Spectrograms of the Flores Sea earthquake detected by the cable sections 

and compared with their nearest seismic stations on land. The dashed black lines 

on the spectrograms track the surface wave group velocity dispersion. As a reference, 
we show the group velocity scale for station El.DSB (Dublin, Ireland, Irish National 
Seismic Network). ELVAL, Valentia, Ireland, Irish National Seismic Network, PM.ROSA, 


distant from the transmitting end. In our tests, 
we obtained signals with sufficient SNR from 
repeaters at a cable distance up to 3100 km. As 
this is more than half of the cable length, 
monitoring of all the submarine cable spans 
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could be achieved by also measuring from the 
Canadian end. Both the polarization-induced 
degradation of the signal and the range of 
usable loop-back signals can be, respectively, 
eliminated and extended with further upgrades 


considered in (B). (D) Geographical area for the epicenter location for the Northern 
Peru earthquake, obtained by back-projection. The heatmap represents the probability 
of the epicenter location, with warm colors representing a higher probability. The 
stars indicate the event epicenters, as reported by the Global Centroid Moment Tensor 
earthquake catalog (19, 20), and the black dots show the locations of the six cable 
sections used for the back-projection (10). 


(10). We also successfully tested our technique 
while data traffic was present on other chan- 
nels available in the fiber pair. This demon- 
strates the compatibility of our technique with 
standard optical communication technology, 
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which is a crucial prerequisite for its wide ap- 
plication over the submarine telecommunica- 
tion network. 

We show the detection of two earthquakes 
on multiple submarine cable spans, the North- 
ern Peru moment magnitude (M,,) 7.5 earth- 
quake (28 November 2021) and the Flores 
Sea M,, 7.3 earthquake (14 December 2021), 
respectively (Fig. 2, A and B). We detected 
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the Northern Peru earthquake on six of the 
nine sections of the cable being tested. We 
plotted a spectrogram and time series of the 
signals detected by three of these sections 
(Fig. 2A). The signals and location of the 
additional six spans are shown in fig. S4. The 
spectrograms display the temporal evolution 
of the power spectral density (PSD) of the 
frequency fluctuations of the optical signal, 


Continental shelf 
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induced by environmental perturbations, over 
the cable section that is being measured. For 
all the data shown in this work, we choose to 
display the optical frequency deviation rather 
than phase deviation, as we experimentally 
find that this allows us to better visualize 
notable features in the detected signals. Sev- 
eral factors can contribute to the ampli- 
tude of the observed waveform, including the 
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Fig. 3. Environmental noise detected by different sections of the trans-Atlantic cable. (A to F) Environmental noise detected on continental shelf, deep ocean, and 
Mid-Atlantic Ridge sections of the cable. (G) Environmental noise detected on the full link. Data with insufficient SNR was removed. The range of the color bar of each spectrogram 
is adjusted to best visualize the detected signals. The impulsive noise in the spectrograms corresponding to the time marked by the short horizontal orange lines at the 
bottom of (F) is due to glitches caused by temporary issue with the laser locking setup and not to environmental noise on the seafloor. D, depth. 
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earthquake fault geometry, the three-dimensional 
heterogeneity of the seismic structure be- 
tween the fault and cable, the coupling of 
the cable to the external environment, and the 
length of the fiber. Further tests on a larger 
number of spans and events will allow better 
characterization of the weight of each con- 
tribution. From the time lag between enve- 
lopes of the detected teleseismic P waves 
(Fig. 2A) and using a simple back-projection 
(BP) approach (10), we were able to resolve 
the epicentral location of the seismic event 
(Fig. 2D). 

The Flores Sea earthquake was detected on 
five of the nine cable sections being tested (fig. 
87). The spectrogram of the optical frequency 
deviations (Fig. 2B) induced by the seismic 
wave on the SI, $2, and S5 cable sections 
(Fig. 2C) is shown in comparison with the 
closest available seismic station on land. Al- 
though the P wave arrival is not visible on the 
span data, the surface wave dispersion is clearly 
resolved and shows very good agreement with 


that measured by land seismometers. In ad- 
dition to the measurements on separate sec- 
tions of the cable, we also measured the optical 
frequency changes on the entire UK-Canada 
cable on a separate channel of the same fiber 
pair. We performed this measurement by con- 
necting the fiber pair together at the Halifax 
end, such that we obtained a return signal in 
Southport after a 11,720-km-long round trip. 
We stress that in this arrangement the signal 
does not go through the HLLBs but forward- 
propagates for the entire length of the cable, as 
in our previous work (5). Whereas the surface 
wave of the Flores Sea earthquake was also 
visible in the full-cable measurement (fig. $7), 
the Northern Peru earthquake was not detected 
(fig. S4). This is because the detected surface 
waves from the Flores Sea signal had compo- 
nents between 0.01 and 0.1 Hz, which we 
measured to be a relatively low noise frequency 
range on the seafloor, whereas the detected 
body waves from the Northern Peru signal 
had components in the 0.1 to 5 Hz range, 


which is considerably noisier. The detection 
of the Northern Peru earthquake therefore 
demonstrates the substantially better detec- 
tion threshold achieved with spanwise mea- 
surements, as the background environmental 
noise on the other parts of the cable does not 
contribute to the measurement noise floor. 
We show the spectrogram of the environ- 
mental noise measured by different sections 
of the intercontinental submarine cable from 
the cable landing station (CLS) in South- 
port to the Mid-Atlantic Ridge (MAR) on 
21 October 2021 (Fig. 3). On the first 269-km 
cable section, which crosses the Irish Sea floor, 
we observed narrow-band components in the 
1 to 5 Hz range that repeated approximately 
every 6 hours (Fig. 3A). This periodic pattern is 
synchronous with tidal currents in the Irish 
Sea, and their amplitude correlates well with 
the neap/spring cycle, as shown by a longer 
dataset in Fig. 4A. We speculate that the 
observed signals arise from “strumming” of 
exposed cable sections or pressure changes 
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Fig. 4. Correlation of detected signals on cable spans with wave height 
and tidal currents. (A) Spectrogram of the frequency changes detected on the 
span shown in (C) compared with wave height measured at the M2 buoy in 
the Irish Sea and tidal current velocity measured by the Dublin Port tide gauge, 
showing a high degree of correlation. (B) Storm detected on the span shown 

in (D) in late November 2021 and correlation with water levels and tidal current 
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velocity measured in the Irish Sea. (© and D) Maps of the cable sections 
used and M2 buoy. (E) Comparison between the power spectral density of 
environmental signals (earthquake and currents) and the calculated measure- 
ment noise floor that would be obtained if an off-the-shelf telecommunication 
narrow-linewidth laser, instead of research-grade laser, were used on a 
46-km and 90-km span (10). 
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induced by the tidal currents. We also observed 
these periodic signals on the second section 
close to the southeastern coast of Ireland (Fig. 
3B), although they were less pronounced, and 
a strong microseismic component in the 0.07 
to 0.15 Hz range. The latter feature was likely 
caused by the coupling of ocean wave energy 
to the shallow seafloor. 

On the cable section that extends from the 
Irish coast to the end of the continental shelf, 
where the bathymetry is consistently flat, we 
observed a drop in the environmental noise 
level of up to 30 dB (Fig. 3C and fig. $8). On 
the fourth section (Fig. 3D), extending from 
the end of the continental shelf to the MAR 
area, we detected excess noise between 0.5 
and 1.5 Hz, this time with a 12-hour repeat- 
ing pattern, which we attribute to tidal cur- 
rents in the MAR area. We also observed a 
large increase in the detected noise as the 
cable approaches and crosses the MAR (Fig. 3, 
Eand F). We attribute this to the interaction 
of ocean currents with the changing MAR 
bathymetry. Around this area, sea mounts 
rise from 3740 to 2330 m water depth, with 
the cable possibly being suspended between 
irregularities of the seafloor and thus more 
subject to current-induced movement. By com- 
paring Fig. 3F and Fig. 3G, we can identify 
several features on the spectrogram of the 
integrated round-trip signal over the entire 
Halifax-Southport cable (Fig. 3G) on that of 
the cable section across the MAR (Fig. 3F), 
indicating that this area is a large contribu- 
tor to the overall measured noise. We also 
observed a high level of correlation of the 
periodic signals observed on the shallow water 
sections of the cable with the tidal current 
velocity (Fig. 4A). We also found a high level 
of correlation between the signals in the 0.1 
to 0.5 Hz range and the wave height mea- 
sured at the M2 buoy in the Irish Sea (Fig. 4A). 
We identified Hurricane Larry to be the like- 
ly source of the dispersive microseism in the 
0.05 to 0.1 Hz frequency range between 13 
and 17 September (70). We show an 8-day-long 
dataset in late November 2021 (Fig. 4B), on 
the cable section shown in Fig. 4D. During 
this time, an extratropical cyclone moved 
southward across the UK, and a resulting 
strong microseismic component was ob- 
served on the Southport-Dublin section of 
the cable. This correlates very well with the 
increase of average wave height measured 
at the M2 buoy. 

Crucially, in contrast to previous optical in- 
terferometric tests, which required a research- 
grade ultrastable laser (5), the technique we 
have demonstrated here greatly relaxes the 
requirements on the laser source. Because 
the optical perturbations are measured over 
sections of the cable rather than the whole 
length, the requirement for the laser coher- 
ence length is scaled accordingly (10). For 
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the submarine cable used in these tests, the 
coherence length requirement is reduced by 
approximately a factor of 129, equating to 
the number of ~46-km-long spans between 
repeaters and to the CLSs. Although in our 
experiment we still used a research-grade 
stabilized laser developed at NPL, we show 
that commercially available narrow-linewidth 
lasers could be used instead to detect most of 
the signals we have presented. An example is 
in the PSD of the M,, 7.5 Peru earthquake 
(Fig. 4E) detected on the 95-km-long S3 span 
of Fig. 2C. For comparison, the contribution 
of the laser instability of a narrow-linewidth 
telecommunication laser to the measurement 
noise floor over the same span length is dis- 
played. We also show the PSD of the current- 
induced noise measured over the 92-km-long 
S4 span west of MAR, which is up to 30 dB 
above the expected laser contribution to the 
noise floor. 

By enabling the use of commercially availa- 
ble telecommunication lasers, our technique 
could easily be scaled up and applied to a large 
number of cables in the existing submarine 
network. By converting submarine cables into 
arrays of environmental sensors, a large net- 
work of hundreds or thousands of perma- 
nent and real-time seafloor sensors could 
be implemented without modification of the 
existing subsea infrastructure. This has the 
potential to transform our understanding 
of both shallow and deep processes inside 
Earth. The ability to record seismic phases 
in the middle of ocean basins could enable 
the imaging of previously obscured structures 
such as mid-ocean ridges and oceanic fault 
zones, advancing our understanding of the 
processes that underlie the generation of 
oceanic crust (77) and the mechanisms by 
which oceanic plates are hydrated (12). 

The cable sensitivity to water currents could 
be explored to improve our understanding of 
deepwater flows (thermohaline circulation), 
including the proposed slowing down of ocean 
currents due to rising global temperatures 
(13, 14). Furthermore, the joint detection of 
deepwater currents and seafloor pressure 
changes caused by the passing of ocean waves 
implies strong potential for tsunami sensing 
(15, 16). Lastly, while more research is needed 
to characterize long-term measurement drifts 
(10), the sensitivity of the optical cables to 
temperature could be explored for climate 
change research (17, 18). 
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Universal antigen encoding of T cell activation from 
high-dimensional cytokine dynamics 


Sooraj R. Achar}, Frangois X. P. Bourassa*t, Thomas J. Rademaker”{, Angela Lee’, Taisuke Kondo®, 
Emanuel Salazar-Cavazos’, John S. Davies*, Naomi Taylor®, Paul Francois”, Grégoire Altan-Bonnet’* 


Systems immunology lacks a framework with which to derive theoretical understanding from high- 
dimensional datasets. We combined a robotic platform with machine learning to experimentally 
measure and theoretically model CD8* T cell activation. High-dimensional cytokine dynamics could be 
compressed onto a low-dimensional latent space in an antigen-specific manner (so-called “antigen 
encoding”). We used antigen encoding to model and reconstruct patterns of T cell immune activation. 
The model delineated six classes of antigens eliciting distinct T cell responses. We generalized 
antigen encoding to multiple immune settings, including drug perturbations and activation of chimeric 
antigen receptor T cells. Such universal antigen encoding for T cell activation may enable further 
modeling of immune responses and their rational manipulation to optimize immunotherapies. 


he design of immunotherapeutic tools 
may be enhanced by quantitative ap- 
proaches tackling multidimensional 
immune responses (7). Although high- 
dimensional snapshots of immune cells 
are routinely acquired (2-4), they are often ill 
suited to elucidate the time-dependent pro- 
cesses of immune responses. For instance, ex- 


Fig. 1. Robotic platform to quantify the 
dynamics of antigen-driven activation 

of T cells. (A) The TCRaf signaling machinery 
of T cells engages a repertoire of peptide—major 
histocompatibility complex (pMHC) ligands 
(antigens), triggers a signal transduction cas- 
cade, and activates a functional response (e.g., 
cytokine secretion). MAPK, mitogen-activated 
protein kinase; PI3K, phosphatidylinositol 3- 
kinase; JAK, Janus kinase; STAT, signal trans- 
ducer and activator of transcription; IFN-y, 
interferon-y; IL, interleukin; TNF, tumor necrosis 
factor. (B) T cells must deconvolve quality and 
quantity to accurately discriminate between 
antigens. (C) Robotic platform to multiplex 
immunological settings and dynamic measure- 
ments (e.g., cytokines). This platform generates 
arge datasets quantifying the inflammatory 
outputs generated by T cell activation ex vivo. We 
used a custom-designed pipeline to compile 
igh-throughput bulk cytokine, surface marker, 
and single-cell data (see the supplementary 
materials, section 2.1). (D) Single readouts (here 
IL-2 and IFN-y) of OT-1 T cell activation at varied 
time points for different antigenic peptides 


= 


the amount of secreted cytokines depending 
on antigen quality and complete confusion 
depending on antigen quantity (right). (E) 
nformation between antigen quality and 
secreted cytokines combined versus individually. 
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SIIVFEKL(V4), SIITFEKL(T4), SIIQFEKL(Q4), and . 
SIINFEKL(N4)] (left) failed to deconvolve antigen 
quality and quantity no matter the time point ° 
(center). This resulted in partial overlap between : 


= 
TT 
Step1 a Step 2 saree Step 3 
Antigen recognition [ium Antigen Encoding Beem Cytokine Expansion 


Multiplexed 
cytokine bead array 


pression of the Programmed Cell Death-1 (PD-1) 
protein correlates with either T cell activation 
or exhaustion depending on the timing and 
context (4). Such complexity stems from the 
multivariate responses triggered by multiple 
ligands (antigens) engaging a monogenic 
T cell receptor (TCR) (Fig. 1A). Downstream 
responses drive distinct patterns of activation 
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dynamics encoding different stimuli using 
feed-forward processing (5, 6). 

We set out to robotically generate high- 
precision, multidimensional kinetic data of 
T cell activation, and applied supervised 
machine-learning approaches (7) to learn a 
dynamic encoding of information (8, 9). We 
focused on modeling the responses of T cells to 
antigens on target cells. Current understand- 
ing of T cell responses defines three functional 
classes of antigen, non-agonists, weak agonists, 
and strong agonists, on the basis of their impact 
on thymic selection: death by neglect, positive 
selection, and negative selection, respectively 
(10). This observation led to the concept of 
antigen quality, which has proven useful in 
predicting successful eradication of viruses 
or tumors but has been difficult to predict 
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from antigen sequences (11). Estimating anti- 
gen quality usually involves measuring the 
antigen levels triggering 50% of the maximal 
immune response (ECs9) (10-12). However, 
antigen quality should be defined as an ab- 
solute property because higher quantities of 
weak antigens do not functionally match low 
quantities of strong antigens (Fig. 1B) (13). 
We measured the kinetics of multiple cyto- 
kines during T cell activation by different 
quantities and qualities of antigen (i.e., differ- 
ent peptide sequences). We built a robotic 
cell culture platform combined with a high- 
throughput data-processing pipeline (14) to 
multiplex automatic measurements of cyto- 
kine concentrations and single-cell activation 
markers over multiple time points (Fig. 1C and 
fig. S1). We tested different antigens (SIINFEKL 
peptide and its variants) recognized by the 
mouse ovalbumin-specific (OT-1) TCR on CD8* 
T cells (10, 12) and found diverse yet over- 
lapping patterns of cellular activation and 
cytokine secretion (Fig. 1D and figs. S2 and S3). 
Using mutual information (15-19), we quan- 
tified how well the cytokine dynamics at 
individual time points correlated to antigen 


A 


= 
Antigen = 
Quality ~ 
me \4 
— 4 3 
—T4 S 
= 2 
me G4 = 
—E1 a 
S 
Antigen 3 
Quantity * rae 
——= 107° M 
=—=10°77M aa 
uw 
pases 10-8 M Zz 
EE 
=== 10°M 
0 24 48 72 
Time (h) Time (h) 
= 
& 
> 
LS) (a. u.) 7 
_ 
102 4 Cytokine Time (h) 
elFN-y et Fae 
S e IL-2 os 
& 1074 elL-17A @72 “se 
S IL-6 | a 
= TNF rs = 
5S 10! a 
DB re Antigen 
5 war ® @N4 + G4 
9 10° oar mQ4 *E1 
ina = AT4 4 A2 
#V4 > Y3 
107+ 10° 10% 10% 103 0 20 
Data (pM) 


SCIENCE science.org 


quality independently of quantity and found 
that over time, these dynamics carried subs- 
tantially higher information than individual 
cytokines at a given time point (Fig. 1E and 
supplementary materials, section 6). 

To help explain immune activation, we ana- 
lyzed and classified these complex dynamics 
with machine learning. Using the cumulative 
integral of the log-transformed concentrations 
of cytokines over time (fig. S6B and movie S1), 
we trained a neural network that performed 
supervised classification of cytokine dynamics 
according to antigen sequences irrespective of 
antigen quantity (Fig. 2, A and B, and fig. S8). 
We used a learning architecture with a two- 
node bottleneck for classification to match 
our estimate of the dimensionality of our 
datasets (fig. S6C). This projected the data 
onto a latent space where, after training, the 
five-dimensional (5D) cytokine time courses 
were compressed into 2D trajectories (Fig. 2C). 
The 2D trajectories separated well according 
to antigen sequences and were independent of 
antigen quantity over three orders of magni- 
tude (Fig. 2, A and C). We refer to such well- 
separated projections in latent space as “antigen 


: Latent - 
Cytokine: Space : Antigen 


encoding” to emphasize how T cell responses 
are classifiable according to antigen quality 
(as determined by antigen sequences). This 
encoding reliably distinguished antigens even 
at high doses, when other markers typically 
saturate (Fig. 1D and fig. S11C). 

We tested whether 2D antigen encoding pre- 
served the information of the high-dimensional 
kinetics of the system. We built a nonlinear 
decoder to reconstruct the entire 5D cytokine 
dynamics (Fig. 2, D to F, and supplementary 
materials, section 5) from the 2D latent space, 
demonstrating that essentially no information 
was lost. To further test our decoder, we used 
two antigens that were not included in the 
initial model training. We compressed the 5D 
cytokine dynamics spurred by these antigens 
through our trained network and used the 2D 
latent space trajectories as inputs to recon- 
struct the 5D raw data (Fig. 2, E and F). In this 
way, we could map high-dimensional cytokine 
data onto our low-dimensional latent space. 

We turned to mathematical modeling to 
derive more biological insight about antigen 
encoding. We fitted the cytokine trajectories 
in the latent space with four-parameter ballistic 


Fig. 2. Antigen encoding from a neural network that 

compresses the dynamics of T cell activation according to 

N4 quality independently of quantity. (A and B) Dynamics of 
five cytokines for different quality and quantity of OT-1 antigens 

Q4 (left) are preprocessed (A) and classified (B) according to 

T4 antigen sequences through a trained three-layer neural net with 

a 2D bottleneck (LS;,LS2). (©) Time courses of compressed 


p N v4 cytokine dynamics in the latent space stratify T cell activation 
: No GA according to antigen sequence (quality) independently of 
: : antigen quantity (see movie Sl for data animation). Color and 
: & E1 style for each line is the same as in (A). (D) Examples of 
: latent space trajectories for different antigen qualities. (E and 
ea F) Nonlinear reconstruction of cytokine time series from latent 
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space trajectories yielded an accurate fit with experimental 
data (fig. S19) 
encoding [four peptide antigens are introduced here: EIINFEKL(E1), 
SIIGFEKL(G4), SIYNFEKL(Y3), and SAINFEKL(A2)] (F). 


E), demonstrating completeness of antigen 
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Fig. 3. Quantifying the classes of T cell activation using antigen encoding in latent space. (A) Cytokine 
trajectories in latent space can be fitted with a four-parameter ballistic model. (B) Although one parameter 
(vy) is constant, the three others (Vo, ®, and to) are strongly correlated. a. u., arbitrary units. (C) Distribution 

of ECso (left) that maximizes the mutual information extracted from model parameters (We estimate a channel 
capacity of =2.6 bits; supplementary materials, section 6). By evenly sampling the cumulative distribution 
function (CDF, center), we found 27° = 6 classes of antigen with nonoverlapping latent space trajectories 
(right). The arrow connects the 36-hour coordinates of trajectories of increasing antigen ECso. (D) Latent space 
coordinates (LS;,LS2) at 36 hours as a function of antigen quality as defined by ECso. (E) Sketch of the 
biochemical mechanisms governing antigen discrimination as suggested from the latent space coordinates in 
(D). Each class of antigen activates positive and negative signals differentially and yields varied patterns 

of cytokine secretion computed from the decoder (Fig. 2E) and trajectories in (C). 


equations comprising an initial activation 
characterized by an initial speed vo, followed 
by a decay (supplementary materials, section 4; 
Fig. 3A; and movie S1). We discovered strong 
correlations between the fitted parameters 
(Fig. 3B) and the ECs, measure of antigen 
quality (fig. $14B). This demonstrated that 
one parameter (e.g., Vo) was sufficient to de- 
scribe each trajectory, thus defining antigen 
quality through a continuum of dynamics in 
the latent space. Ultimately, we obtained a 
generative model of cytokine dynamics by 
combining our nonlinear decoder (Fig. 2, D 
and E) with our one-parameter model (figs. 
$21 and S22). 

Such parametrization of antigen encoding 
allowed us to apply information theory (15-19) 
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to estimate the number of antigen classes that 
T cells can distinguish. We measured the var- 
iability of cytokine kinetics across many rep- 
licates and used our continuous model to 
estimate a theoretical limit of 2.6 bits of in- 
formation transduced by the TCR pathway 
(figs. S23 to S26 and supplementary mate- 
rials, sections 4 and 6). This defined 27° = 6 
classes of antigens associated with distinct 
EC; and cytokine dynamics evenly distributed 
in the latent space (Fig. 3C). Therefore, learn- 
ing to classify patterns of TCR activation asso- 
ciated with discrete antigen sequences allowed 
us to build a continuous model of antigenicity 
subdivisible in at least six nonoverlapping 
classes compared with the previously de- 
scribed three canonical discrete classes. 


The latent space also captured key modal- 
ities in terms of biological regulation. The 
latent space coordinates (LS,,LS.),_~36nr varied 
as a function of antigen quality (ECs) (Fig. 3, 
C to E). Whereas LS, increased monotonically, 
LS, had an inverted bell-shaped dependency. 
The “decoding” weights in the second layer 
of the neural network lined up with these 
dependencies, as expected for antigen classi- 
fication (fig. S27). Thus, the latent space can be 
interpreted as a competition between mono- 
tonic and non-monotonic pathways, consistent 
with the adaptive kinetic proofreading (AKPR) 
model that accounts for T cell antigen dis- 
crimination (13) and with information opti- 
mization in genetic networks (20). 

We tested antigen encoding in diverse im- 
mune settings. We first plotted the parameter 
Vo (Fig. 3A) as a function of ECs, for both our 
training and test sets and recovered a log- 
linear correlation above a minimum thresh- 
old (Fig. 4A). Our model of cytokine dynamics 
yielded a measure of antigen quality, vo, as 
faithful as an EC;, derived from a functional 
dose response. We further confirmed the pre- 
dictive power of vy on the following additional 
immunological settings: (i) two states of dif- 
ferentiation of OT-1 T cells (naive or pre- 
activated effector; fig. S28); (ii) different 
antigen-presenting cells (splenocytes, bone 
marrow-derived macrophages, and dendritic 
cells) pulsed with peptides (fig. S29); (ii) dif- 
ferent mouse TCR-transgenic T cells (Fig. 4B 
and fig. S12); (iv) human T cells expressing a 
TCR transgene (Fig. 4C and fig. S30); and (v) 
different tumors constitutively expressing 
antigens (Fig. 4D and fig. S31). Although the 
latent space may rotate (angle 0; Fig. 3A) 
according to immune contexts, e.g., when 
varying the number of T cells (Fig. 4E and fig. 
$32), in all settings, the structure of the latent 
space was preserved and Vy accurately ranked 
antigens according to their quality. Over- 
all, our model generalized well to infer anti- 
gen quality from unseen patterns of immune 
activation. 

We tested whether molecular perturbations 
(small drug inhibitors, cytokines or antibody 
blocking) could generate new cytokine dynam- 
ics. For most perturbations, the strong correla- 
tion between parameters of antigen encoding 
remained intact, and antigen hierarchy was 
preserved, albeit with a downshift in appar- 
ent antigen quality. However, two types of 
perturbation (activation of innate signaling 
and inhibition of cytokine signaling) maxi- 
mally broke this parameter correlation and 
generated distinct dynamic patterns, i.e., new 
classes of T cell activation with potentially 
new functions (Fig. 4F and fig. S33). 

We also investigated T cells genetically 
engineered to express a chimeric antigen 
receptor (CAR) against ligands expressed on 
B cell leukemia (CD19), as used in patients 
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Fig. 4. Antigen encoding to explore diverse immunological settings. 

(A) Testing the ballistic model of the latent space on all OT-1 antigens yielded 
measurements of Vo that correlated with their rank (as defined by the ECs 
of a functional dose response; fig. S12). (B to D) Antigen encoding learned 
from naive OT-1 T cells yields accurate Vo ranking with different mouse 
TCR-transgenic T cells (B), with human TCR-transgenic T cells (C), and with 
OT-1 T cells responding to tumors constitutively expressing antigens (D). 

(E) Antigen encoding captures dependency on T cell numbers in (Vo,0) space. 


(F) Left: antigen encoding (Vvo,6) of T cell activation under various molecular 
perturbations. Right: Earth mover's distance (EMD) for individual antigens. PC1 
captures the variation in antigen quality (EMD2 = 0). Although most molecular 
perturbations simply downgraded the apparent antigenicity (EMD1 < 0, 

EMD2 = 0), JAK inhibitors (AZD1480 and ruxolitinib) severely affected T cell 
activation (EMD2 < 0) and the TLR7/8 agonist resiquimod boosted it (EMD2 > 0). 
(G) Antigen encoding of anti-CD19 CAR-T cells (left) revealed a pattern of weak 
activation and antagonism for cytokine responses (right). 


with relapsed lymphoma (27). The dynamics 
of cytokines released and consumed by CAR- 
T cells were similar to the trajectories derived 
from natural T cells (Fig. 4G, left). CAR-T cells 
responding to CD19 yielded cytokine dynam- 
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ics similar to the OT-1 TCR responding to weak 
agonists such as V4 (Fig. 4G, right), so signaling 
through CAR may only suboptimally activate 
T cells. We therefore tested the response of 
CAR-OT-1 T cells to a mixture of CAR and TCR 


ligands. Our latent space parametrization then 
revealed highly nonlinear, hierarchical effects: 
Weak OT-1 ligands (E1) antagonized CD19, 
whereas stronger OT-1 ligands dominated 
the response to CD19. This is consistent with 
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the AKPR mechanism for T cell signaling with 
differential effects on positive and negative 
feedbacks (Fig. 3E) (13). Complex antagonism 
and synergism by weak TCR ligands highlight 
the importance of fine-tuning CAR-T cells for 
immunotherapeutic optimization. 

Our approach yielded large, multiplexed 
kinetic measurements and a quantitative mod- 
el of universal antigen encoding during T cell 
activation. Whereas classical approaches have 
qualitatively defined three discrete antigen 
categories, we rigorously derived a continu- 
ous parametrization of antigen quality. We 
applied information theory to quantify and 
interpret the phenotypic diversity of T cell 
activation beyond functional assays taken 
at a single time point. The universality of the 
model should prove useful for quantifying and 
fine-tuning the strength of antigens for immuno- 
therapies and vaccine development (7). Thus, 
robotic platforms combined with machine 
learning and mathematical modeling provide 
quantitative tools with which to study high- 
dimensional biological dynamics and may help 
to optimize therapeutic strategies. 
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Don’t panic 

was interviewing for the Ph.D. program of my dreams when my potential adviser invited me to 
look at a sample under the microscope. I meant to grab the eyepiece, but my maneuver went 
spectacularly wrong when my finger got caught between the two middle buttons of his shirt. My 
hasty attempt to remove it only made things worse, as the shirt popped open all the way down 
and even his undershirt pulled out of the waist of his pants. Even now, 15 years later, just writing 
about it is incredibly embarrassing. But my ability to not panic in that moment landed me the 
position, and became one of my greatest life lessons. 


It was the second phase of the se- 
lection process for an international 
Ph.D. program. Coming from the 2a) W, 
war-torn country of ex-Yugoslavia, x A 
Thad passion for research but zero 
experience. The 2-day interview 
got off to a rocky start when an- 
other professor—the one I most 
wanted to work with—was so im- 
pressed by my competitor’s pre- 
sentation he gave her a high-five. I 
went to the bathroom to cry, then 
reminded myself there was more 
than one spot to be filled, washed 
my face, and headed to the net- 
working dinner. There, my future 
adviser found me. “It’s brave of 
you to apply with such a track re- 
cord,” he said. “Don’t miss visiting 
my lab tomorrow.” I wasn’t sure 
whether to be flattered or insulted, 
but I knew I needed to impress him 
to have a chance at the program. 

The next day, things seemed to go 
from bad to worse. The shuttle that was supposed to take me 
to the lab never showed up. I desperately hailed a taxi, but 
was stymied by a language barrier until I thought to show the 
driver the hospital’s logo on my paperwork. After he dropped 
me off, I had to find my way through a block of 40 hospital 
buildings. When I finally entered the room full of my compet- 
itors, with large sweat circles under my armpits and my face 
like a swollen tomato, the professor seemed amused. That’s 
when he invited me to the microscope and I “undressed him,” 
as the story was retold by those who witnessed the scene. 

The rest of the group was frozen with embarrassment, 
but I managed to keep my composure. Compared with what 
I had seen growing up—bombings, widespread corruption, 
catastrophic inflation—this seemed like a relatively minor 
disruption. I took charge of the situation. “I’m so sorry,’ I 
said, then turned to the science. “There is a nonuniformly 
stained part of the cell. Which organelle is it?” 
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“When confronted with 
challenges, | have managed to keep 
acool head and stay positive.” 


Years later, I asked my adviser 
why he selected me for the position, 
even though my track record was 
weak and I had made a total fool 
of myself. “Oh, that’s simple,” he re- 
plied. “I was looking for a dedicated 
troubleshooter who won’t freak out 
at the time of crisis. And this is ex- 
actly what you demonstrated dur- 
ing the interview.’ At that moment, 
I realized that maybe the apparent 
setbacks during the interview hap- 
pened not to me but for me, be- 
cause otherwise I would have had 
no chance to stand out. 

Looking back, I see my whole 
life as a series of such experiences. 
I was denied Ph.D. funding in my 
home country as its institutions 
crumbled. My adviser relocated 
to another country in the middle 
of my Ph.D. My application for 
a prestigious grant was rejected 
when I was already stretched thin 
maintaining my career after my second child was born. 
But when confronted with challenges, I have managed to 
keep a cool head and stay positive—an approach that has 
helped me find my way through, all the way to the tenure- 
track position I recently secured. 

My intensely difficult early experiences amid political and 
economic turmoil helped me develop resilience in the heat of 
crisis. Bad things happen, and not everything has a silver lin- 
ing. But I’ve learned to accept what I can’t control and take 
charge of what I can. I certainly don’t have it all figured out. 
But I know that, whatever I am going through, and however 
undeserved and devastating it feels, it helps tremendously to 
ask myself a single question: What if this is happening not to 
me, but for me? 


Senka Holzer is a group leader at the Medical University of Graz. 
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